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Preface 



The Seventh International Symposium on Spatial and Temporal Databases 
(SSTD 2001), held in Redondo Beach, CA, USA, July 12-15, 2001, brought 
together leading researchers and developers in the area of spatial, temporal, and 
spatio-temporal databases to discuss the state of the art in spatial and temporal 
data management and applications, and to understand the challenges and re- 
search directions in the advancing area of data management for moving objects. 

The symposium served as a forum for disseminating research in spatial and 
temporal data management, and for maximizing the interchange of knowledge 
among researchers from the established spatial and temporal database commu- 
nities. The exchange of research ideas and results not only contributes to the 
academic arena, but also benefits the user and commercial communities. 

SSTD 2001 was the seventh in the series of symposia that started in Santa 
Barbara a dozen years ago and has since been held every two years, in Zurich, 
Singapore, Portland (Maine), Berlin, and Hong Kong. By 1999, the series had 
become well established as the premier international forum devoted solely to 
spatial database management, and it was decided to extend the scope of the 
series to also cover temporal database management. This extended scope was 
chosen due, in part, to the increasing importance of research that considers 
spatial and temporal aspects jointly. 

The call for papers attracted 70 submissions, which were submitted by au- 
thors from 18 countries and 5 continents, clearly indicating the international 
nature of the research area and the symposium. The number of submissions 
represents a 30% increase over the previous two symposia. The program com- 
mittee selected 25 research papers and two industrial papers for presentation 
and discussion at the symposium and inclusion in these proceedings. 

The papers in this volume cover diverse aspects of the management of spatial, 
temporal, and spatio-temporal data. We were delighted by the large number of 
high-quality papers in the relatively new area of spatio-temporal databases. In 
particular, aspects of database management for moving objects - including mod- 
eling and querying, data representation, and query processing techniques - were 
covered by papers in four sessions. One session was dedicated to spatial data 
warehousing and mining, which is also a new, emerging topic. As well as these 
contributions, the sessions of the symposium also included excellent contribu- 
tions on important, classical subjects, such as indexing, storage structures, cost 
estimation, and other query processing techniques. In addition to the research 
sessions, an industrial session described developments at two of the dominant 
vendors in the area of the symposium. 

Although not covered in these proceedings, the symposium also featured 
four tutorials, three of which were given by speakers from industry. Each tutorial 
covered emerging technologies in the symposium’s area: mobile e-services, spatial 
and temporal data mining, the emerging Geography Markup Language and other 
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standardization developments, and the current and future spatial features in the 
products of a major database vendor. A panel discussed one of the hot topics in 
the area of spatio-temporal databases, namely that of experimental evaluation of 
query processing techniques. Robert Abarbanel from the Boeing Company gave 
an engaging keynote speech that explored advanced uses of spatio-temporal data 
and consequent research challenges. 

Putting together this symposium was a team effort. Special thanks go to the 
members of the program committee, and the external reviewers they enlisted, 
for their hard and diligent work. As a reflection of their dedication, almost all 
the reviews arrived on time. A discussion phase after the initial reviewing aimed 
to understand and possibly resolve diverging reviews; this phase proceeded ef- 
fortlessly, with each program committee member taking an active part. 

Dieter Pfoser managed the technical aspects of the fully electronic reviewing 
process with dedication; he handled the receiving of submitted papers and their 
subsequent distribution to the reviewers; and he customized and managed the 
software for receiving reviews. Alfred Hofmann at Springer- Verlag lent us his 
professional help. 

The members of the “local” team of organizers at the University of California, 
Riverside, deserve much credit for their essential contribution. Marios Hadjieleft- 
lreriou established and maintained an impressive Web presence for SSTD 2001, 
and Donghui Zhang managed the registration process. Furthermore we would 
like to thank Joyce Aklrtarklravari, Jessica Lin, Eilene Montoya, Dimitris Pa- 
padopoulos, Terri Phonharath, and Michalis Vlachos for their help at various 
stages of the SSTD organization. 

The symposium was sponsored by ESRI and Microsoft, and it was held in 
cooperation with the US National Center for Geographic Information and Anal- 
ysis and the University of California, Riverside. We thank these organizations 
for their support. 
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Jianwen Su*, Haiyan Xu**, and Oscar H. Ibarra* 

Department of Computer Science 
University of California 
Santa Barbara, CA 93106 
USA 

{su,xu, ibarra}@cs .ucsb . edu 



Abstract. In moving object databases, object locations in some multi- 
dimensional space depend on time. Previous work focuses mainly on 
moving object modeling (e.g., using ADTs, temporal logics) and ad hoc 
query optimization. In this paper we investigate logical properties of 
moving objects in connection with queries over such objects using tools 
from differential geometry. In an abstract model, object locations can 
be described as vectors of continuous functions of time. Using this con- 
ceptual model, we examine the logical relationships between moving ob- 
jects, and between moving objects and (stationary) spatial objects in 
the database. We characterize these relationships in terms of position, 
velocity, and acceleration. We show that these fundamental relationships 
can be used to describe natural queries involving time instants and in- 
tervals. Based on this foundation, we develop a concrete data model for 
moving objects which is an extension of linear constraint databases. We 
also present a preliminary version of a logical query language for moving 
object databases. 



1 Introduction 

Existing technology has made it possible to track down movements of target ob- 
jects in the air (e.g., airplanes), on the land (e.g., vehicles, wild animals, people, 
etc.) and ocean (e.g., ships, animals). Among the challenges novel applications 
involving such “moving objects” have brought to software development is the 
problem of data management. In a nutshell, there is a wide range of issues in- 
cluding modeling and representation of moving objects, query language design, 
indexing techniques, query optimization, and even extensions to the well known 
and widely adopted “data independence” principle. Prior work on spatial and/or 
temporal databases is relevant but insufficient for moving objects. The goal of 
this paper is to revisit the foundation of data models and query languages for 

* Supported in part by NSF grants IRI-9700370 and IIS-9817432 

** On a sabbatical leave from Department of Computer Science, Fukoka Institute of 
Technology, Japan 
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moving objects by taking a new perspective from a branch of mathematics — 
differential geometry. 

Management of moving objects has been investigated in the recent years. 
Sistla and Wolfson et al pSWCD97IWXG.I9£j j developed a data model combining 
future temporal logic (FTL) with topological operators. Although FTL allows 
reasoning about temporal properties on individual moving objects, the coupling 
of the spatial and temporal relationships is inevitably loose hindering the reason- 
ing process as well as eventually the query optimization. Coming from spatial 
data modeling, a group of researchers proposed extensions based on abstract 
data types (ADTs) !HGSV9fIFGNSnniGBE+00| . These are practical solutions 
to the modeling and querying problems. Unfortunately, ADTs are too “rigid” 
to express sometimes intimate temporal and spatial configurations of moving 
objects. Indeed, once the operators are fixed, new and sophisticated relation- 
ships involving moving objects may be inexpressible since the “host” language 
(likely SQL) lacks the ability to “look into” the inside of an ADT. On the other 
hand, the focus of the modeling and querying was mostly on the spatial and 
tempo ral rela tionships that can be described in algebraic geometry. Giiting et 
al |GBE + 0fij realized this deficiency and added four operations for computing 
velocity, derivative, turn, and speed as ADT operations. But the underlying rea- 
soning for including these is unclear. They seem to deviate from the well known 
principles of differential geometry |MP77K)ra98i |. Forlizzi et al fF.GNSMl also 
considered moving lines and regions which is not the focus of this paper. 

Constraint databases present an elegant conceptual model for spa- 

tial and temporal data. The primary idea is to use mathematical formulations to 
represent spatial and temporal data and use first order logic to express queries 
Ik l POOI . However, the framework is based on elementary algebra and thus the 
relationships expressible are the ones in algebraic geometry. For example, rea- 
soning about moving speed and direction is out of reach. 

There are other related work on moving objects, but these mostly focus on 
indexing and algorithm issues iK mmmiaai and location management and 
dealing with imprecision |WGD.l97ttWGD + 981W.TS + 99l|. 

In this paper, we consider modeling moving points in some vector space 
and use techniques from differential geometry [ GrahK ; . A focus is to conduct an 
exposition with well known mathematical tools. Through the analysis of logical 
relationships concerning moving points, we conclude that simple primitives of 
velocity and acceleration along with vector operations are sufficient to express 
not only movement-related relationships but also the well studied topological and 
temporal relationships. Based on this finding, we propose an extension to the 
linear constraint database model which allows these simple primitive operations. 
We also develop an extended calculus query language based on some standard 
linear constraint query languages (see |I KLP00 ! ) . We show that this language has 
polynomial time complexity. 

This paper is organized as follows. Section 2 gives some preliminary concepts, 
and Section 3 introduces the abstract model for moving points. Section 4 focuses 
on the relationships of the moving objects. Section 5 presents a design of a 
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concrete data model based on linear constraints and associated query language 
extended from the constraint calculus. Section 6 concludes the paper. 

2 Preliminaries 

In this section we briefly review some of the fundamental concepts related to 
curves in multi-dimensional space and some operations on vectors. These con- 
cepts will be used in the technical discussions throughout the paper. We also 
discuss necessary concepts of spatial databases that we will use in conjunction 
with moving objects. 

We assume the existence of primitive data types such as real numbers and 
time instants. The domains of these types are standard. In our discussions, the 
domain of time has a dense total order isomorphic to the real numbers, i.e., time 
is continuous. For spatial data in the n-dimensional space, we assume the exis- 
tence of spatial types such as region nl line ni point n for each n (> 1). The domains 
of these types are semi- algebraic regions, lines, and points (respectively). Semi- 
algebraic sets can be easily represented by quantifier-free first-order formulas 
using the constraint database approach pCKRQSliKLPOOlj . 

We will model moving objects in such a way that their positions are contin- 
uous functions of time in an appropriate space. We use standard concepts in- 
cluding “differentiable” and “derivative” from differential calculus (cf praSEl )- 
Although these concepts are abstract and mathematical, the techniques from 
constraint databases provide a simple conceptual paradigm and useful techniques 
of manipulating constraint representations of abstract functions. In this respect, 
we will consider function data types such as time — > real , which can be used to 
represent speed of moving objects. 

Let n be a positive integer and n, ..., r„ be n data types. An n-ary vector of 
type Ti x • • • x r n is a tuple (zq, ..., v n ), where Vi is a value in the domain r, for 
each 1 < i < n. 

Our focus is on the moving objects that are points in the n-dimensional (real) 
space for some integer n > 1. For this purpose, we also view the zz-ary vector 
type real x • • • x real as a data type point n for points in the n-dimensional space. 

Let u = (zq,...,zt n ) and v = (zq,...,z; n ) be two n-ary vectors over the real 
numbers. We allow the following standard operations: u + v = (zq + zq, ..., u n + 
v n ), and c • u = (c • zq, ..., c • u n ) where c is a real number. (Note that u — v 
can be defined as u + (—1) • v.) The dot product of two vectors u = (zq, ..., u n ) 
and v = (zq , ... ,v n ) is defined as u • v = A”L 1 zqzq. The (Euclidean) length of a 
vector u, denoted as ||u||, is defined as ^/u ■ u. A unit vector has the length 1. 
It is clear that for each given vector u, one can scale it down to obtain the unit 
vector by jjjjy x u. Unit vectors are useful to represent moving “directions”. 

Unless otherwise specified, we assume that n is the dimension in which spatial 
objects and moving objects exist. 
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3 Moving Objects 



In this section, we present an abstract model of moving objects using vector 
space and related standard methods from differential geometry [IMP77IGral98j 
and highlight some of the useful fundamental concepts and constructs related 
to query languages for moving objects. In the next section, these concepts and 
constructs are used to explore and formulate logical relationships of moving 
objects, which will help in developing query languages. 

Let n be a positive integer. In our abstract model, a “moving point” in the 
n-dimensional space is a function from time to point n that is continuous and 
infinitely differentiable. In applications, moving objects may change directions 
and/or become stationary from time to time. To accommodate such changes, we 
allow a moving point to consist of a finite number of segments of movements, 
where in each segment the function remains infinitely differentiable. 

Let t and t' be two time instants and t < t' and / a function over time. We 
denote the restriction of / over the domain 



Definition. A moving point is a function of type p : time — > point n that can be 
represented as a vector of functions p(t) = (pi (f ) , P 2 (t) > • ■ ■ ,Pn(t)) satisfying the 
following conditions: 



1 . for each 1 < i < n, pi is continuous over time, and 

2. There exist a positive integer m and m time instants ti < t^ < ■ • • < t m 
such that for each 1 < i < n, 

a) Pil(— oo.ti) andpj|p m)00 \ are infinitely differentiable, and 

b) for each 1 < j < to — 1, It: |(t, ,t, + , ) is also infinitely differentiable. 



In the remainder of the paper, we use p(f), q(f), . . . to represent moving 
objects, or just p, q, . . . when it is clear from the context. We also use notations 
such as Pi,qj to mean the i-th and j-tli components of p and q, respectively. 

Let p = (pi, ..-iPu) be a moving point. Clearly (pi, ... ,p n ) defines the position 
of p as a function of time. The velocity of p is defined as the derivative of p: 
vel( p) = p' = (j/| , ...,p' n ). The acceleration of p is the second order derivative of 
p, acc(p) = Oe/(p))'. 

In the following we give some example of using these primitives to express 
some commonly seen properties. These are in fact some basic concepts from 
differential geometry for curves. Let p, q be moving points. 



1 . 



2 . 

3 . 



Moving direction of a moving point at some time instant is the tangent vector 
of the curve at this time instant. The moving direction can be computed as 
the directional derivatives, i.e., vel( p). Sometimes, we scale it to a unit vector 

l vel(p) 

y ||«ei(p)|| ■ 

Speed of a moving point is the distance it travels through in a unit time. This 
is also related to velocity and can be expressed as the length of the velocity: 

IM(p)||. 

Distance between two moving points is the shortest Euclidean distance be- 
tween the two points. This is easily expressible as the length of the difference 
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of two vectors: ||p — q||. If x is a stationary point, the distance of p and x 
is defined in the same way: ||p — x||. Sometimes it is useful to reason about 
distance between a point and a region (or line). If r is a region, the distance 
between p and r is defined as: min xer .{||p — x|| }. 

4. Two points are moving in the same direction. If p, q are two moving points, 
they move in the same direction iff the unit vectors of their moving directions 
are exactly the same. Similar, we can use the moving directions to express 
similar directions of two moving points. 

The focus of this paper is on the query languages for moving object databases. 
The modeling and representation problems for moving objects are studied in this 
context. There also are interesting issues of how to model moving objects from 
the application point of view. For example, linear equations may be appropriate 
to approximate truck movements while quadratic equations are more accurate 
for flight paths. These issues should be addressed for specific domains and are 
not in the scope of this paper. 



4 Logical Relationships of Moving Objects 



In this section, we study logical relationships of moving objects of interest to 
queries over such objects. Previously investigated relationships for moving ob- 
jects can be roughly grouped into two categories: relationships based on time 
instants and on time intervals |SWCD97IWXC.l 981KGSV 99ICRFA00IFGNS00I . 
Clearly, spatial relationships such as topological relations |FF91j and temporal 
relationships are relevant to moving objects. However, these relationships can 
be described in algebraic geometry. Movements of the moving objects bring new 
perspectives of the relationships. Indeed, relationships such as moving speed 
and direction are only natural but also they need differential geometry methods 
IIMP77ICraM that are beyond algebraic geometry. We argue that primitives 
including velocity and acceleration from differential geometry )Cra!)8l are not 
only necessary for expressing relationship concerning movements, but also suffi- 
cient to express spatial and temporal relationships. This provides a foundation 
to develop a data model and a query language for moving objects, which will be 
discussed in the next section. 

To facilitate the discussions, we consider a database with the following classes: 



Flights (airline: string , number: string , position: moving -points) 
Airports (code: string , state: string , location: region 2) 

Runways (airport: string , number: string , landing-direction: real x real) 
Counties (name: string , state: string , area: region) 



Here the relation Flights contains the flight information: the airline name and 
flight number, and the current position of the aircraft. The position data is a 
moving point in the 3-dimensional space. The Airports and Counties relations 
contain only stationary spatial data for airport locations and county regions 
which are regions in the 2-dimensional space. The relation Runways include 
runway orientations specified as unit vectors in the 2-dimensional space. 
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We consider the following queries. 

q\\ Locate all United flights over Ventura at 4pm, January 29, 2000. 

< 72 : List all pairs of flights within 300 miles of the San Francisco Airport right 
now that are moving in opposite directions and both are increasing speed. 
< 73 : Report all flights approaching LAX within 45 degrees of the orientation of 
runway 3 that are below 20,000 feet. (This may be needed for rescheduling 
flight landings due to a runway change.) 

< 74 : Find all United flights that entered Santa Barbara (SB) county during time 
ti to t2- 

< 75 : Show all flights that have been flying above LA county for the last 15 minutes. 

Although the queries and the databases seem slightly artificial, it is not difficult 
to imagine that such temporal-spatial relationships could easily occur in queries 
in various applications. Our focus is to develop appropriate model and query lan- 
guages through the study of the logical relationships needed in querying moving 
object databases. 

If we take a close look at the above queries, <71 is a spatial selection (range 
intersection) along the longitude and latitude for a given a time instant (snap- 
shot). This indicates that the topological, spatial, and temporal relationships in 
the traditional temporal-spatial databases remain relevant to querying moving 
object databases. However, the movements of the objects introduce new types 
of relationships. For example, it should allow computation of moving speed from 
which distance and time information can be derived. Also, moving direction is 
important and useful. They are exhibited in queries <72 and < 73 . Query <74 includes 
a property “enter” which is different from the intersection predicate in the tra- 
ditional spatial relationships. In fact, the predicate does not make much sense 
for static objects but is very useful for moving objects. For query < 75 , one way to 
express the property is to first extract the trajectories of all flights in the speci- 
fied time duration and followed by a spatial containment query. An alternative 
is to state directly that these flights indeed stayed in the area during the last 15 
minutes. The latter is easier to understand and perhaps more desirable. 

The functionality of a DBMS goes far beyond query processing. Similarly, 
a conceptual data model should not only include data representation tools but 
more importantly support data manipulation languages which are easy to use 
and whose queries can be optimized. To reach this goal, we have to understand 
logical relationships for moving objects and the goal to develop simple and fa- 
miliar yet expressive primitives that can facilitate query optimization. 

The 9-intersection model for topological relations IkPTT| simplifies the topo- 
logical properties. It would be desirable to develop similar simple models for 
moving object relationships. For example, we can classify binary moving object 
relationships in the following three dimensions: between moving objects or mov- 
ing and stationary objects, involving time instant or time interval, and related 
to position, velocity, or acceleration (Fig. [I] below). 

Fig.Q shows some relationships in some of the categories. For example, the 
predicate “in” refers to a moving object is inside a region at a time instant. This 
can be used for query < 74 . “Enter” states that the object intersects the boundary 
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Fig. 1. Three dimensional view of moving object relationships 



of a region in the direction from outside to inside; the latter part is related to the 
velocity. Query q 4 will need this. “Opposite direction” (“aim-at”) means that two 
objects are moving toward each other (respectively the object is moving toward 
a region) at an instant, which can be used for < 72 - “Collision” means that two 
objects are in the same position at an instant. “Collision course” refers to objects 
on a collision course for a period of time. “Meet” (or “land”) is a refined version 
that requires the objects’ speed to be the same (or 0 , resp.) and acceleration to 
be either positive for the slower object or negative for the faster object, or both. 
“Approaching” specifies the changes in speed and direction toward collision for 
a time period. 

Clearly, there are many more relationships that one can formulate among 
moving objects. It is thus interesting to identify “fundamental” or “primitive” 
ones. 

Mathematics provides tools for conceptualization and modeling in computer 
science and especially in databases. Indeed, much of the development and study 
of the relational languages has been based on first order logic (see AH \ *)~> ) . In 
spatial databases, elementary geometry techniques were the basis for characteriz- 
ing topological relationships (e.g., IEF91|Ege91p . The introduction of constraint 
databases brought algebraic geometry into the database context H\ 1.1 * 1)0 , where 
more complex topological properties can be analyzed using real variables and 
elementary algebra. Representative work includes p?)V9i^KPd B 97tK d B 9 9| . It 
is little surprising that this long tradition would continue. 

Differential geometry combines (differential) calculus with geometry (see 
[IMPTTlCraOSj l. The use of calculus allows the capture of much finer geometric 
properties. Take curves as an example, velocity of a curve (i.e. , moving point) 
is simply its (directional) derivative and the derivative of the velocity gives the 
acceleration. Other important properties such as curvature and torsion can then 
be expressed. In fact, the Fundamental Theorem of Curves states that each curve 
is uniquely determined by its curvature and torsion (modulo position) IMP77I . 

Once a time instant is fixed, moving objects are simply points in the space. 
Therefore, all spatial relationships are relevant and needed. In addition, the time 
dimension explicitly exists for moving objects and temporal relationships should 
also be included. Although the combination of temporal and spatial relationships 
can help to express “entering a region”, they are nevertheless insufficient for 
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reasoning about the moving directions and speed as well as their changes. It is 
the weakness inherited from the underlying mathematical model of elementary 
geometry. The weakness was also observed in | IGBE + 00| . Although the model 
of |IGBE + 00| for moving object uses ADTs with operation including derivatives, 
their approach of simply adding missing and needed operations makes their 
language cumbersome and very difficult to use. The consequences are that the 
language is difficult to analyze and query optimization is hard or even impossible. 
We believe that the difficulty comes really from their ADT approach, and not 
from the moving objects themselves. A necessary step towards an elegant simple 
query language is to analyze the fundamental relationships of moving objects. 

For the focus of our exposition on logical relationships of moving objects, 
we consider velocity and acceleration of moving points represented in the vector 
space, along with the usual operations on vectors. Thus in our model, we provide: 

1. Vectors (vector space) for representing moving points and vector additions 
and multiplications by real numbers. 

2. If p is a vector of a moving point, then the velocity vel(p), acceleration 
acc(p), length ||p||, and moving direction dir( p) = |i^eZ(p)|| • 

Although the operations listed above are very simple, we show that many 
interesting logical relationships including those we listed in Fig.^are expressible 
in terms of these. Intuitively, distance can be obtained from vector length. With 
distance one can easily express topological relationships in the sense of IEF01I . 
Moving direction information is derivable from velocity. Moving direction can 
help to express some temporal relationships between moving objects. Our model 
also allows the primitives to be combined using arithmetic constraints (in the 
spirit of differential calculus) and logical connectives. 

Although according to the Fundamental Theorem of Curves, curvature and 
torsion are key properties in characterizing curves, we do not include them as 
primitives. This is primarily due to our focus on queries in the present paper. 
We believe that curvature and torsion are important and can be useful in things 
such as comparing and querying trajectories. This will be left for future work 
since our focus of this paper is mainly on the motions. 

In the remainder of this section, we give a few example relationships and show 
how they can be expressed. Although these are mostly standard in differential 
calculus |Cra98j. we include them here for exhibiting the intimate relationships 
between moving object modeling/querying and differential geometry techniques. 

We first look at some distance relationships. Given two moving points p and 
q, their distance at time t can be expressed as ||p(t) — q(t) || , which we denote 
as dist(p,q, t). The following are some relationships concerning distance: 

— distance Jessthan(p,q,t,d) = (dist(p,q,t) < d). 

— in(p,r, t) = (p(t) inside 7'), where r is a region and “inside” is a spatial 
predicate. 

— collision (p,q,t) = (p (t) =q (t)). 

— catching -up(p,q,ti,t 2 ) = (Vtt' {t\<t<t' <£ 2 ) — ► dist(p,q,t)>dist(p,q,t')). 
Similarly “stays _in” can be expressed. 
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For moving direction related relationships, velocity is needed. 

— opposite -direction (p,q, i) = (dir(p)(t) + dir(q)(t) = 0) and also 
same-direction(p.q,t) = (dir(p)(t) = dir(q)(t)). 

— on.collision-course(p, q, t\, t^) = 

(\/t(ti < t < t 2 — > oppo site -direction t)) A3t(t 2 < t A collision(p, q, t))). 

— aim-at(p,r,t) = (3x(x inside r A («rat(x — p(t)) = dzr(p)(f)))), where 
wmt(x) converts x to a unit vector. 

— enter(p,r,t) = (onJine(p, boundary(r), t) A 3t' (1/ < t AVt" (t r < t" < t — > 
- | (p (f) inside r)))) where “boundary” returns the boundary lines of a region 
r. The formula states that just before p moves across the boundary of r, p 
is always outside of r. Note that this expression uses interval property and 
not velocity. It can also be expressed with the moving direction from the 
velocity of p. 

Finally, in some cases acceleration is needed. For example, land(p,x.,t) = 
dist(p,x,t) = 0 A ||ve/(p)|| = OAacc(p) < 0, where the last condition states that 
the velocity slows down along all coordinates. 



5 A Linear Model for Moving Objects 



In this section, we introduce a new data model and a query language for moving 
objects. Unlike the earlier models based on ADTs {En^WQFGNSODlGBE+OO) 
and temporal logic pWCD9Yj , the new model combines the linear constraint 
techniques from constraint databases IKIUOOI with the operations and relation- 
ships we studied in the previous section. We show that queries in the language 
can be evaluated efficiently in polynomial time. 

Constraint databases |KKR.95i| are particularly suitable for conceptual mod- 
eling and manipulation of multi-dimensional data IkLMH . Central to constraint 
databases are mathematical formulas with a prescribed semantics (a.k.a. con- 
straints) 0 Constraint database techniques provide a fundamentally sound ap- 
proach to spatial and temporal database applications. The elegance of constraint 
models lies in that they allow spatio-temporal data to be viewed conceptually as 
mathematical objects in algebraic geometry. The important data independence 
principle can be very naturally supported. 

Constraints can be integrated into relational or object-oriented structures 
to suit applications. For example, the data model using conjunctions of linear 
constraints as values that can then be used as values in a classical relations 
was adopted in the DEDALE system jGRS98) . the constraint query and update 
languages of | !BB(”i98ll . and the COSMOS project |K11SS98| . With this model, 
one can represent information such as follows. Consider the relation Counties 
which contains the names, states, and regions of counties. Here region data are 
spatial, representing the administrative 



This is not to be confused with integrity constraints. 
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areas. We can represent the information by a relation Counties shown above. 
The attribute area has values that are 2-dimensional regions in the (real) plane. 
In a linear constraint database such regions are represented by mathemati- 
cal objects using boolean combinations of linear equations and inequalities. 
For example, region^ is represented as a formula ip(xi,x<z) = 68<a;i<70 A 
z 2 <2 A 70<:ti-|-8:e2i where a point (aq,^) is in region v iff (p(x\,X 2 ) is true. 

Linear constraints over the real numbers may involve equality and order 
predicates (=, <, <, >, >) and addition (+). The following are examples of linear 
constraints: Ef =1 aiXi = ao, LJf =1 aiXi < oo, where aq’ s are variables and oq’s are 
real (or rational for the linear casqj) numbers. 

We now define our linear constraint model for moving objects. Intuitively, 
we use linear constraints to represent the functions from time to points in space. 
Since time and space are over real numbers, this means that the linear constraints 
are over a time variable t and n dimension variables, aq, ...,x n . 

Definition. A moving point p has a linear constraint representation if there 
exist a positive integer m and real numbers ai,...,a m satisfying the following 
conditions. 

1. a i < 02 < • • ■ < a m , and 

2. let ag be a symbol representing — oo and a m +i representing +oo (for technical 
convenience), and for each 1 < i < n, the function Pi{t) is represented by 
the following constraints: 

m 

Pi(t ) — ( Xi — bijt -(- Cjj A Oj A f A 1 ) (1) 

3=0 

where 6^’s and c^ ’s are real numbers and for each 1 < j < m and each 
1 < * < n, 

bij — 1 Oj -(- Cij — 1 bijO>j T Cij (2) 

In the above definition of a linear constraint representation of a moving point 
p, each coordinate pi of p is a linear function of the time variable t (equation H), 
and p may change speed and direction at time instants oq, ..., a m . The condition 
(0 in the definition is to ensure that the coordinate functions are continuous 
at these time instants. Clearly linear functions are infinitely differentiable. In 
fact, their derivatives are constants and second or higher order derivatives are 
all zeros. 

2 The first order theory of rational numbers with multiplication is undecidable. 
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Example 5.1 Consider a tuple in the relation Flights where the moving point 
values of positions are in the 3-dimensional space. A part of the position value 
of the tuple is shown as follows: 

xi=2t— 40 A 0<t<21 V a?i=2 A 21<f<22 V *i=|t-9 A 22<t<47 
x 2 = - t+23 A 0<t<21 V * 2 = - t+ 23 A 21<t<22 V *2=1 A 22<f<47 
*3=30 A 0<t<21 V * 3 = - 5t+135 A 21<t<22 \J x 3 = - f+47 A 22<t<47 

The airplane described above moved towards southeast first, turned at time 21 
(and position (2,2,30)) and also started to descend, and made another turn at 
time 22 (and position (2, 1,25)) and continued to descend until landed at time 
47 (and position (14.5,1,0)). I 

Example 5.2 Continue with Example rm The velocities of the airplane were 
(2, —1,0) from time 0 to 21, (0,-1,— 5) during the interval (21,22), (1,0,— 1) 
during the final landing phase. The speeds were y/5, v^, y 5 , respectively. I 

In the linear representation case, clearly the velocity is always constant and 
the acceleration remains 0. Therefore the acceleration primitive is unnecessary. 
In our model, we will have vectors and the usual operations, vel, dir , and || • |. 

Using the above linear constraint moving points as a primitive type, we can 
build a data model for moving points in a systematic way. We use the relational 
model to demonstrate the approach, although it is as easy to use an object 
oriented model. 

An ADT encapsulates the internal structure of a data type and thus provides 
a clear interface by a set of operations on the data type. This is appropriate 
for some applications in databases such as to deal with binary large objects. 
Although the ADT approach may be appropriate in some applications, we believe 
that it is not an ideal approach for spatial and temporal databases. This is due to 
the need to distinguish different spatial data types, and the intimate relationships 
which exist between them. On one hand, modeling all temporal-spatial types 
using a single ADT over-simplifies the data structure and loses the flexibility of 
doing spatial and temporal reasoning. On the other hand, if we treat different 
temporal-spatial types as different ADTs we are forcing another “rigid” design 
and spatial and temporal reasoning is equally hard. 

In any case, ADTs are limited by the operations they provide. If the queries 
and update operations are completely predictable, ADTs can be developed. For 
example, it would be impossible to express some logical relationships between the 
elevations of two objects had elevation not been an ADT operation. Therefore, 
in cases ad hoc queries are to be supported, one has to provide a logic language 
that is capable of combining ADT operations beyond sequential compositions. 
An interesting question here is then what is the minimum set of basic ADT 
operations needed since some operations can be expressed by combining the 
basic operations. 

We believe the temporal-spatial data types should encapsulate implementa- 
tion details as ADTs do but they should expose a “conceptual structure” which 
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can be used for expressing temporal spatial relationships. The latter differs from 
the spirit of ADTs0 We propose the notion of a “logical data type” (LDT). 
Intuitively, an LDT in our model provides a relation schema for representing the 
data inside the values and the data can be accessed directly outside of the LDT 
and be reasoned about. We give the following technical notion. 

Definition. Let r = time — > real be a function of type from time to real num- 
bers and n a positive integer. The n-ary logical vector type (LV type), denoted 
as P n , is the type r n = r x • • • x r. The domain of P n is the set of all moving 
points having linear constraint representations. 

Although P n resembles syntactically a product type of n function types from 
time instants to real numbers, the semantics (domain) is different. We use P n 
to represent moving points in n-dimensional space in this paper, where each 
function gives the mapping from time to a coordinate value. 

A relation schema is a finite set of pairs (A, T), where A is an attribute name 
and T a type or LV type such that the attribute names are distinct. A tuple over 
a relation schema R is a total mapping from attribute names in R to elements in 
the domains of the respective types or LV types. A relation instance of a relation 
schema R is a finite set of tuples over R. A database schema is a finite set D of 
relation schemas and a database instance of D is a total mapping I from D such 
that for each R in D , I(R) is a relation instance of R. 

We now consider query languages for our model of moving objects. We start 
with a discussion on constraint query languages for constraint databases. 

The relational calculus was extended to a constraint query language IKK MSI . 
In the traditional constraint database context, a first-order formula ip(x) with 
free variables x defines a query Q in the following sense: if J is a constraint 
database, the answer to Q on I is defined as 



Q(I) = {a | the database I satisfies y>(a)}. 



A key point here is that the answer Q(I ) may be an infinite relation but should 
be definable as a constraint relation. 

It is not obvious that the first-order logic defines a query language for con- 
straint databases. The key “ingredient” is the quantifier elimination property 
of the first-order theory of real closed fields. A logical theory admits quantifier 
elimination if every formula is equivalent to a quantifier- free formula. The fact 
that the theory of real closed fields admits quantifier elimination was first dis- 
covered by Tarski jTa.rbl j . There are tractable algorithms to perform quantifier 
elimination for a fixed number of distinct variables [RKR861Col751Ren92lj . This 

to design the constraint database framework and 



property was used in 
is necessary [GESZi. 

In our model, a relation has a finite set of tuples. Tuples provide some de- 
scriptive and discrete information, while the spatial regions are represented as 



3 In theory, one could argue that ADTs could provide conceptual structures but such 
ADTs can’t really support optimization of sequences of operation on the ADTs. 
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the same way as in constraint relations (e.g., |K RSSf)8] h and moving points are 
represented as values in LV types. 

We can extend the relational calculus (or a constraint query language) for 
moving objects in our model. Specifically, we allow real variables (represent- 
ing time and spatial coordinates) to access the data inside values of LV types, 
linear arithmetic constraints on these real variables, and first order construc- 
tions (V, A, -i, V, 3) on the constraints. The language design needs to be carefully 
crafted to incorporate LV types and associated operations and, more impor- 
tantly, to ensure query results to always be relations (i.e., finite sets of tuples 
with values in the domains of types and LV types). Although this is not techni- 
cally difficult, it is not a trivial task. 

The terms in our calculus include variables (with associated types or LV 
types), values in the domain of types or LV types, addition of two terms of type 
real (or time), multiplication of a real (or time) term by a real number, and also 
the following. 

— If p, q are terms of type P n and c a term of type real , then p + q, c • p are 
also terms of type P n . 

— If x is a vector, unitpx.) is to convert it into a unit vector with the same 
direction. 

— If p is a term of type P n , vel( p), dir( p) are terms of type P n and len( p) is 
a term of type real. 

Since a value of type an n-ary vector of real types can also be viewed as a value 
of type P n (with all constant functions). We naturally extend the operations on 
LV types to include vectors of reals. 

Let p be a value in the domain of P ni a be a time instant, and 6 1; ...,b n be 
n real numbers. The expression “p(a; b ±, ..., &„)” is true if the moving point p is 
at the position (foi , ...,6„) at the time instant a. 

The formulas in our query language include the following. 

— x = y if x, y are terms of the same type, or x < y if x, y are real or time 
terms, 

— p(t; X \, ..., x n ) if p is a term of type P n , t a time term, and x \, ..., x n are real 
terms. 

— R(xi, ..., 2 fc) if I? is a relation schema in the database and xp s are terms of 
the respective types. 

— —up, ip Aip, ipVip, 3x<p, Vxtp are formulas if <p, ip are formulas and a; is a variable. 

A query is an expression of form “{(xi, ...,Xk) \ </?}” where <p is a formula with 
all free variables in X \, ..., Xf-- 

We illustrate our query language through the example queries listed in Sec- 
tion 4. We use the first two letters to abbreviate relation names. 

1. The query qi can be expressed as follows. 



j(n,xi,x 2 ,x 3 ) 3p3r 



Fl(UA, n, p) A p(t 0 ; x\, x 2 , x 3 ) A 
Co(Ventura, CA, r) A r(x i,x 2 ) 
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Here to is the value for the time instant “4pm on January 19, 2000”. Also, r 
is a 2-dimensional region and the expression “r(x i,x 2 )” is true if the point 
(xi, X 2 ) is in r. Note that this is a conjunctive query and directly corresponds 
to the English version. 

2. Query < 72 , since all moving points in linear representation have no accelera- 
tion, can be expressed as follows: 

{(ni, 712 ) | 3pqxiX 2 rp¥'L{x\ 1 ni 1 p) A Fl(:T 2 , n 2 , q) A Ai(SFO, CA, r) A 
dir{ p)(W) + dir(q)(t now ) = 0 A r{jp) A 
3yi2/22/3P(t now 5 2 /i,y 2 , 2 / 3 ) A len{p-(y 1 ,y 2 )) < 300 A 

3ziZ 2 z 3 q(t now ; zi, z 2 , z 3 ) A len{p—(z\, z 2 )) < 300} 

Here f now stands for the current time. The first line of the expression finds 
the locations of two flights and the airport region. The second line ensures 
the moving directions to be opposite to each other. The third and fourth 
lines are testing the distances of the flights to the airport. 

3. For query q 3 , the condition on the approaching angle is expressed by ex- 
amining the direction vector of the flight projecting to the plane and the 
orientation vector of the runway. After the former is converted into a unit 
vector, the angle condition can be expressed as an equivalent condition on 
the distance. The query expression is shown below. 

{(*, n) | 3p ly!y 2 y 3 Fl(x, n, p) A Ru(LAX, 3, l ) A dir(p){t now ; y u y 2 , y 3 ) A 
len(unit(yi , y 2 ) — l) < \/2 — \[2 A y 3 < 20000} 

4. In query < 74 , the interesting condition is “enter”. Note that it is expressed 
naturally as in differential calculus. The flight enters the county at time t if 
the flight is not above the county at every time instant just before t: 

{(n) | 3p tr Fl(UA, n, p) A Co(SB, CA, r) A t\ < t < t 2 A ip(r, p (t)) A 
3t'{t' <tA\/t"{t' <t" <t^^{r,p(t"))))} 

where ip(r, p(t)) is a formula which test if at time t the moving point p is in 
the (2-dimensional) region r. The formula ip is similar to the corresponding 
part of the formula in query q\ . 

5. The query q§ expresses the condition that the flight stays in a region by 
explicitly stating that for every time instant it is in the region. 

{(a;, n) \ 3pr Fl(cc, n, p) A Co(LA, CA, r) A 
V£ (t now 15 < t < 

^now * 3xiX 2 X3(p(t; xi,x 2 ,x 3 ) A r(xi,x 2 )))} 

Note that this query states a property which should be held for a period of 
time. It can also be expressed with temporal logic but the temporal 

operators are almost disjoint from expressions for spatial relationships and 
the expressions are less intuitive. 

We briefly discuss evaluation of queries in our query language. Basically, 
queries in our language can be translated into the constraint query language of 
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fK K K95| augmented with relational attributes. Since the language of f KKR95l 
contains multiplications, it is easy to see that it gives an effective evaluation 
algorithm. Thus we have: 

Theorem 5.3 Each calculus query can be evaluated in polynomial time (in 
terms of the database size). 

There are several issues concerning the language and its semantics: 

1. Consider query q\ above. It pulls coordinate values out from inside a moving 
point and returns as results. Such a use may cause some query results to be 
infinite, i.e. , “unsafe”. This is not a problem if we extend the definition of 
relations to be finitely representable relations jCS97| . Another possibility is 
to return the entire moving point value as a value of LV type. 

2. Length function computes the length of a vector. Clearly length is not lin- 
ear. Uncontrolled use of the length operator would cause results to contain 
non-linear equations. One way to overcome this, is to treat length as an ag- 
gregate function and to limit the syntax to only allow “staged” evaluation 
of aggregate functions. This will guarantee the output to be representable in 
linear constraints, similar to rsm . 

6 Conclusions 

In this paper, we present a model for moving objects using techniques from 
differential geometry. By studying the relationships related to moving objects 
and queries over them, we show that primitives of velocity and acceleration from 
differential geometry are expressive enough for topological, spatial, and temporal 
relationships, as well as for relationships for movements. We also developed a 
concrete data model based on linear constraint databases and a query language 
for moving objects. 

Although our work is very preliminary, the differential geometry tools cer- 
tainly yield a fresh look at the modeling and language issues for moving objects. 
There are many issues remaining. One issue is to fine tune the syntax of con- 
straint query formulas to guarantee the closure property. It is also interesting 
to study the relationships between trajectories and in this case, it appears that 
curvature and torsion are useful. Introducing aggregate functions is yet another 
issue. An important aspect is to deal with integrals that is left out from our 
current model. At a more fundamental level, efficient algorithms and query opti- 
mization are the key issues to the eventual success of any models and languages. 
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Abstract. Moving object databases are becoming more popular due to 
the increasing number of application domains that deal with moving 
entities and need to pose queries. So far implementations of such 
systems have been rather weak and certainly not at industrial strength 
level. In this paper we define a concise data model and a set of 
powerful query predicates for moving objects. Moreover, we propose 
an implementation design based on off-the-shelf industrial solutions 
enhancing thus the applicability and robustness of our approach. 

Keywords: Spatiotemporal models, query language, data types. 

1 Introduction 

There is a wide variety of applications that convey or manipulate objects that change 
their spatial features through time. Examples are: GIS applications involving time (i.e. 
evolution of regions over time, etc.), and real time applications for vehicle location 
sensing (cars, trains, aircraft or vessels monitoring etc). Another category of evolving 
such applications with intensive spatiotemporal content and dependencies are: inter- 
active multimedia documents, virtual reality applications, 3D animations, etc. The 
aforementioned application domains currently lack database support. It is a challenge 
to design a model that integrates the representational primitives in these domains and 
covers the requirements of the traditional spatial applications (i.e. GIS, enriched with 
time aspects) as well as of the new kind of applications mentioned above. 

Such models will enable storage of application objects as structured entities and then 
enable queries based on their spatiotemporal stmcture. In this context we envisage objects 
that move/change through time. These objects have a spatial extent that may change over 
time. From the extent of the object we may derive its position and, if applicable, its orien- 
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tation. Then we would be interested in relationships among such objects. The relationships 
have spatial and/or temporal aspects as well as causal ones (i.e. the causes of the relation- 
ship, not only the resulting image i.e, of two overlapping objects). 

This illustrates the need for capturing these aspects of the objects in the database. A 
problem with the recently proposed approaches [5] is that their feasibility for imple- 
mentation is low as they provide a very large set of data types and attached operations. 

As a concrete example, fleet management involves the storage and querying of 
spatiotemporal objects. Assume a taxi service that needs to know continuously the 
location of taxi cubs and optimize their service. Potential queries include the 
following: 

Which is the closest taxi to a specific address? 

Which taxis will be within 5 km from a clients’ address in the next 10 
minutes? 

Have the trajectories of taxis A and B intersected in the previous 2 hours? 
Assuming we know the anticipated trajectories, will taxis A and B be closer 
than 2 km to each other in the next 30 minutes? 

Such queries are quite cumbersome to express in current commercial database 
products that handle spatial/temporal data. Moreover the co-existence of spatial and 
temporal attributes in the system, as well as their interdependencies make the query 
processing issue a challenging problem. 

In this paper we restrict the discussion to moving point objects, i.e. objects with 
zero extent, that change their position continuously over a road network. Such objects 
are called moving objects for short, they are pervasive, but in contrast to the discretely 
changing objects, they are much more difficult to accommodate in a database. 
Supporting these kinds of moving objects by extending existing database technologies 
is exactly the challenge addressed by this paper. 

What is needed in current real-world spatiotemporal applications is a small and 
robust set of predicates with high expressive power, suitable for realistic 
implementation based on off the shelf DBMS technology. The implementation 
solution should cover query processing schemes for the predicates and the 
accompanying indexing schemes. In this paper we introduce a model for moving 
point objects based on the underlying road network, as given by GDT maps [6], We 
define a small set of powerful predicates that can effectively be implemented in terms 
of existing spatial operators in a commercial database/GIS environment [9], and 
provide reasonable expressive power (see Appendix A). 

2 The Underlying Model 

Hereafter we introduce the model that serves as the basis for our spatiotemporal query 
system. The initial information is a map, represented as a relation. Each tuple in the 
relation represents a city block, i.e. the road section in between 2 intersections, with 
the following attributes: 

Polyline : the block polyline given by a sequence of 2D x,y coordinates: 
(xl,yl),(x2,y2),...,(xn,yn). Usually the block is a straight line, i.e. given by two 
(x,y) coordinates. 

Fid : The block id number 

The following attributes are used for geocoding, i.e. translating between an (x,y) 
coordinate and an address such as "1030 North State St.": 
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LJjdd: Left side from street number 

L_t_add : Left side to street number 

R_f_add\ Right side from street number 

R_t_add : Right side to street number 

Name', street name 

Type: ST or AVE 

Zipl: Left side Zip code 

Zipr: Right side Zip code 

Speed : speed limit on this city block 

Oneway: a Boolean Oneway flag 

The following attributes are used for computing travel-time and travel-distance. 
Meters: length of the block in meters 

Drive Time: typical drive time from one end of the block to the other, in minutes 

This format corresponds to maps provided by the GDT Co. [6]. An intersection of 
two streets is the endpoint of the four block-polylines. Thus each map is an undirected 
graph, with the tuples representing edges of the graph. The map can be used as the 
underlying layer for locating moving objects (e.g. taxis). An important concept here is 
the trajectory of a moving object, which, intuitively, gives the route of the object 
along with the time at which the object will be at each point on the route. 

Moving Object. The route of a moving object O is specified by giving the starting 
address or (x,y) coordinate ( start joint), the starting time and the ending address or 
(x,y) coordinate ( end _point). An external routine available in most existing 
Geographic Information Systems, and which we assume is given a priori, computes 
the shortest cost (distance or travel-time) path in the map graph. This path denoted 
P(O) is given as a sequence of blocks (edges), i.e. tuples of the map. 

Trajectory/Route. Since P(O) is a path in the map graph, the endpoint of one block 
polyline is the beginning point of the next block polyline. Thus the whole route 
represented by P(O) is a polyline denoted L(O). For the purpose of processing 
spatiotemporal range queries in the language we define below, the only relevant 
attributes of the tuples in P(O) are Polyline and Drive-Time, 

Given that the trip has a starting time, for each straight line segement on L(O) we 
compute the time at which the object O will arrive to the point at the beginning of the 
segment, and the distance of the point from the start of the route. 

The trajectory of an object O, denoted T(O), is a relation with attributes 
SEQUENCE#, x, y, t, b. A tuple [i, (x,y), ti, b] in this relation indicates that (x,y) is 
the i'th intermediate point on O's route L(O), and O will be there at time ti. The time ti 
is computed based on ti- 1 and the drive-time parameter for the block. In other words, 
the map attributes Drive Time and Polyline define an average speed for each block, 
and we use the length of the line segment and this average speed in order to compute 
each ti. Furthermore, the time at which the object is at any intermediate point on the 
line segment is computed by linear interpolation. In other words, a trajectory is a 
piece-wise linear function in 3D. 
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Fig. 1 . A moving point representation 



A trajectory may represent more than one trip of the same object. If so, the first 
tuple of the second trip indicates, using the Boolean attribute b, that the line segment 
between the endpoint of the first trip and the begin-point of the second trip is not part 
of any route of the object. Namely, if b is False in the i'th tuple of the trajectory, then 
the whereabouts of the object between ti-1 and ti are unknown. Otherwise, i.e. if b is 
True, then the object is " alive " between the two time points, and its location in the 
time interval is computed by interpolation. 

Next we define the type trajectory in terms of SQL DDL statements [11]: 

CREATE TYPE trajectory AS OBJECT 

(SEQUENCE# integer, x integer, y integer, t real, b boolean); 

The MOVING-OBJECTS relation has additional attributes such as COLOR, 
WEIGHT, DRIVER, etc. Then the moving objects (M_0) relation can be defined as: 

CREATE TYPE M_0 AS OBJECT 

(OBJECT_id integer, T trajectory, color integer, weight integer, driver person_id); 

On the attribute T we use a 3D spatial index, i.e. a spatiotemporal index on 
2Dspace+time [16,18]. The query for selecting the trajectory of an object O is: 

Ql: SELECT M_O.T FROM M_0 WHERE OBJECT_id = O; 

Sometimes, for brevity we will use the term id, instead of OBJECT_id. 
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We also need a function that returns the distance traveled along the route of an 
object between two points that belong to the route. Since the route of the object is 
defined as a polyline P, such a function is defined as: route-distance (Polyline P, 
2D_point pi, 2D_point p2). There is an ordering between pi and p2, i.e. pi is visited 
before p2 along the route. Similarly we define the function route-travel-time . Both 
functions will be registered as a User Defined Functions (UDFs) [9] attached to the 
M_0 type. 

2.1 Definition of the Extended Functionality 

The problem we address in this paper is two fold: i. the definition of a concise set of 
data types for moving objects and ii. the design of a small but powerful set of query 
predicates. The data types have been defined in the previous section. Here we will 
define the query predicates in terms of SQL extensions. The definition of the query 
predicates embedded in an SQL statement follows: 

SELECT LOC(id, t) | WHEN_AT(id, location-L) | 

<other attributes of T or moving objects relation> 

FROM T 

WHERE id WITHIN (DISTANCE s | TRAVELTIME t) FROM R 

[(ALONG EXISTING PATH) | (ALONG SHORTEST PATH) ] 

[(ALWAYS BETWEEN) | (SOMETIMES BETWEEN) starttime AND 

endtime] 

where: 

LOC(id, t) is the location on the map of a moving object identified by id, at a 
specific time t. The predicate returns a 2D point corresponding to the location of 
the object. 

WHEN_AT (id, location-L) returns the time at which the moving object 
identified by id was at the location location-L. A set of time values is returned. 
The cardinality of the result is 0 if the object never passed though location-L, 1 if 
it passed once, and more than one if it has passed there several times. 

- WITHIN (DISTANCE s | TRAVELTIME t) FROM R is a predicate that requires 
one of the two operands: DISTANCE s or TRAVELTIME t, where s and t are 
real numbers. R is a point (given by its coordinates) on one of the line segments 
of the map, and can be obtained, for example, by geocoding an address. The 
predicate returns True when either: i. the object needs to travel less than s 
distance units in order to arrive to R or ii. the object needs to travel less than t 
time units in order to arrive to R. This predicate is further refined by the 
following two groups of quantifiers: 

a. ALONG EXISTING PATH and ALONG SHORTEST PATH. They are 
optional mutually exclusive quantifiers. The former requires the WITHIN 
criterion to hold along the route of the object (which implies that R belongs to 
the route of the object). Intuitively, this means that the object can reach R 
within the cost metric while traveling along its existing path. The second 
quantifier requires that the WITHIN criterion holds when following the 
shortest path between the object's current location and R. If none of them is 
present in the query ALONG EXISTING PATH is used as the default value. 

b. ALWAYS BETWEEN and SOMETIMES BETWEEN starttime AND 
endtime. These optional quantifiers relate to the temporal duration of the 
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WITHIN criterion. The former requires that the WITHIN criterion is true for 
all times between the starttime and endtime values, while the latter requires 
that the WITHIN criterion is true for some times between starttime and 
endtime. If none of them is present in the query ALWAYS BETWEEN is used 
as the default value, and the starttime and endtime are assigned the starttime 
and endtime of the trip respectively. 

As is evident, the WHERE clause may have regular SQL atomic conditions on 
regular database attributes (i.e. other than trajectory). 

Hereafter we elaborate on the combined semantics of the WITHIN predicate 
depending on the values of the route related (1. ALONG EXISTING PATH or 2. 
ALONG SHORTEST PATH) and time related quantifiers (A. ALWAYS BETWEEN 
or B. SOMETIME BETWEEN). 

1A: the condition is satisfied for object if every point on o's route between starttime 
and endtime is within route-distance s or route-travel-time t from R (observe that 
using the GDT map format one can compute both travel time and distance as different 
cost measures). Remember, route distance (route travel time) between two points is 
the distance (travel time) along the current trajectory, in the forward (increasing time) 
direction. 

2A: the condition is satisfied for an object o if every point on o's route between 
starttime and endtime is within travel-time t or travel-distance s from R. Here distance 
means distance along the shortest path. 

IB: the condition is satisfied for o if some point on o's trajectory between starttime 
and endtime is within route-distance s or route-travel-time t from R 

2B: the condition is satisfied for o if some point on o's trajectory between starttime 
and endtime is within distance s or travel-time t from R. Here distance means distance 
along the shortest path. 

2.2 Spatiotemporal Index 

We adopt a 3D spatiotemporal indexing scheme similar to the ones proposed in 
[ 1 8] [ 10] [ 16] . The 2D space in which the object moves, and the time dimension form a 
3D space. We are interested in subsets of this domain depending on the quantifiers of 
the WITHIN predicate. We assume the existence of a 3D index in the database 
management system. 

Now we consider the insertion of a trajectory into the database index. To do so we 
enclose each 3D straight line segment into the minimum axes-parallel 3D rectangle 
that contains the segment. This is the Minimum Bounding Rectangle (MBR) of the 
trajectory segment. Observe that the trajectory segment is a diagonal of its MBR. 
Each line segment of a trajectory, its sequence number in the trajectory, its MBR, and 
the object id are inserted as a record in the 3D index. 

In this paper we establish the mapping of the query to a 3D query window Q, such 
that we retrieve all the trajectories that intersect Q. This is the only retrieval from the 
hard disk assuming that the contents of Q are a small part of the database and can fit 
into main memory. Then further processing of the query is carried out in memory. 

Indeed, trajectories in real-life applications like the ones we consider here contain a 
small number of line segments. For instance if a typical trajectory is 10km long and 
each city block is about 0.5 km long, then the trajectory consists of nodes. 
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For Boolean combinations of spatial/temporal predicates (i.e., combination of 
WITHIN predicates) always the resulting set of trajectories will be the intersection (for 
conjunctive queries) or the union (for disjunctive ones) of the respective query spaces. 
Hence the spatiotemporal indexing scheme suffices for Boolean combination of queries. 



3 Implementation Design of the Predicates in a Spatial Database 
Environment 

Assuming that the data types needed are implemented in an OR-DBMSs [1 1, 9, 4] we 
need to implement functions/operators that represent the functionality of these data 
types. These functions are implemented as user defined functions (UDFs) [4], which 
implement operators, or methods, that can be applied to user defined data types. In the 
following we review how commercial DBMSs handle the issues of definition, 
implementation and retrieval of user defined functions. DB2 Spatial Extender in- 
cludes pre-defined functions for operations such as spatial calculations (for example, 
the distance between two points), comparisons (the customers located within a five- 
mile radius of a store), data exchange, and others. To achieve better performance, 
Spatial Extender functions run in an “unfenced” mode in the same address space as 
the database server itself. To support the Spatial Extender, IBM has also extended the 
notion of UDFs to allow them to be used as user-defined predicates. When a query 
includes, in a “where” clause, a UDF that is also defined as a predicate, the optimizer 
knows to look for an index on the column that would help in evaluating the predicate. 
User-defined predicates can be used, for example, to compare two spatial objects, 
such as two polygons or a point and a polygon, to see if they overlap or if one is 
contained within another. As predicates, UDFs trigger the use of indexes. In the case 
of Informix [9] user-defined routines can be written using Informix’s Stored Proce- 
dure Language (SPL), or third-generation languages such as C, C++, or Java. SPL 
routines contain SQL statements that are parsed, optimized, and stored in the system 
catalog tables in executable format, making it ideal for SQL-intensive tasks. Routines 
written in third-generation languages are compiled and loaded into a shared object file 
or dynamic link library (DLL). The routine and name of the shared object are declared 
to the server and once invoked, the shared object is linked to the server. Since C, C++, 
and Java are powerful, full-function development languages, routines written in these 
languages can carry out much more complicated computations than SPL functions. 

Commercial DBMSs provide support for alternative indexing methods in order to 
speed up queries involving user defined operators. In the case of IBM DB2, the DB2 
index manager is open so that new index types — called index extensions — can be 
defined. This includes the type of index, which data types it works with, how index 
entries are generated from column values, and how the index can be exploited to 
evaluate queries efficiently. All of these index extensions are implemented through 
SQL extensions; this high-level API makes it relatively easy for the developer to 
integrate complex, user- defined index functionality. Especially for spatial data these 
extensions were used to implement support for spatial indexes, called grids, in the 
Spatial Extender. 

In the context of Informix, developers can index new data types using existing 
access methods, or add new access methods of their own. Two access methods are 
supported: B-tree index and R-tree index. A B-tree index organizes index information 
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and is arranged as a hierarchy of pages. The Universal Data Option introduces the R- 
tree index, optimized for multidimensional data. 

In the following we propose an implementation design of the proposed query 
predicates in terms of functions and operators provided by commercial DBMSs. 

3.1 Query Processing Scheme for LOC and WHEN_AT Predicate 
LOC(id o, time t) 

This function gets as input an object identifier o and a time t and returns the object’s 
position at time t. The object o is retrieved using an index on the id attribute. 

If t is before the starttime (or after ending time) of the object trajectory, then the 
position returned is the start_point (or the end point) of the trajectory. 

Otherwise, we locate the two times t, t i+1 in T(o) for which it is true that t < t < t i+1 . 
Then we can compute the exact position (x,y) of the object at time t by using linear 
interpolation: 

X = x, + ((X i+1 - x)/( t i+1 - 1)) (t i+1 -t) 

y = y, + ((y i+ r y,y( t 1+ r 0) (**,-0 

Then the 2Dpoint (x,y) is the location of the object at time t. 

When_at(id o, 2D_point p) 

The algorithm retrieves the object, and performs the following steps: 

a. Check if the point belongs to the route (the route of the object is obtained by 
disregarding the time of the Trajectory attribute) of the object. For this one can 
use the system provided function intersecttposition, o. route). If it does not, then 
null is returned. 

b. If it belongs, then find the line segment(s) of the route to which point p belongs. 

c. For each one of them, compute by interpolation the time when the object was at p 

The object may pass more than once by the same point p so the result of the 
function can be a set of times. 

3.2 Query Processing Scheme for WITHIN Predicate 

In the sequel we elaborate on the query processing scheme for the predicate WITHIN. 

(I At): for option 1A (ALONG EXISTING PATH/ALWAYS BETWEEN ) assuming that 
the cost, say 5, is given in terms of travel-time: 

Then the query is formulated as follows: 

Q2: 

SELECT id FROM M_0 

WHERE id WITHIN 5 minutes FROM R 

ALONG EXISTING PATH 
ALWAYS BETWEEN starttime and 
endtime 

/* semantics: retrieve each object for which at 
every time point between starttime and endtime, 
the object will reach R within 5 minutes, while 
traveling on its trajectory. */ 
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We use a filter and refinement approach to process this query. The filter uses the 
spatiotemporal index, and it is as follows. The query window Q is a line segment in 
3D perpendicular to the plane t = starttime. It starts at the plane t = starttime and has 
length (endtime+5 starttime). Thus the line ends on the plane t = endtime+5. Using 
the spatiotemporal index we retrieve the set S of all the trajectories that intersect Q. 
These are the trajectories of objects o. that go through the point R sometime between 
starttime and endtime+5. 

The refinement step is as follows. For each trajectory T of S we consider the sub 
trajectory s between starttime and endtime. If the object is not "alive" on some 
segment of this sub-trajectory, then the trajectory is eliminated from S. 

Otherwise, let el be the time-point when T crosses R for the first time between 
starttime and endtime+5; let e2 be the time-point when T crosses R for the second 
time in the same interval; etc. 

Note that el starttime by definition. If el starttime > 5, then clearly the location 
of the object at starttime is more than 5 minutes away from R along the trajectory, 
thus T is eliminated from S. Otherwise, i.e, if 5 el starttime, then we consider 
endtime. If el endtime, this means that at endtime the object has not reached the 
point R yet; but then clearly, from its location at endtime it will reach the point within 
5 minutes, since at starttime it is not more than 5 minutes away. We conclude that the 
trajectory satisfies the condition, and the algorithm continues to examine the next 
trajectory. Otherwise, i.e. if endtime > el, we have to examine whether the subsection 
of the trajectory between el and endtime satisfies the condition with respect to e2, e3, 
e4, etc. In order to verify this we assign el to starttime and repeat the procedure 
recursively. Specifically, if e2 does not exist or if e2 starttime > 5, then clearly the 
location of the object at starttime is more than 5 minutes away from R along the 
trajectory, thus T is eliminated from S. 

Otherwise, i.e. if 5 e2 - starttime, then we consider endtime. If e2 endtime, this 
means that at endtime the object has not reached the point R (for the second time) yet; 
but then clearly it will reach R again within 5 minutes. Thus the trajectory satisfies the 
condition, and the algorithm continues to examine the next trajectory. 

Otherwise, i.e. if endtime > e2, we have to examine whether the subsection of the 
trajectory between e2 and endtime satisfies the condition with respect to e3, e4, etc. 

The trajectories that remain in S after the refinement step constitute the answer to 
the query. 

(Hit): for options IB ( ALONG EXISTING PATH / SOMETIME BETWEEN) 
assuming that the cost 5 is given in terms of travel-time: 

Then the query is formulated as follows. 

Q 3: 

SELECT id FROM M_0 

WHERE id WITHIN 5 mins FROM R ALONG EXISTING PATH 

SOMETIME BETWEEN starttime and 
endtime 

/* semantics: give me the objects that sometime 
between starttime and endtime are less than 5 
minutes from R when they travel along their 
designated trajectory */ 
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The trajectories that satisfy Q3 are the trajectories that intersect R sometime 
between starttime and endtime+5. This query is simpler than the lAt case. The filter 
step is the same, and the refinement is much simplified. Specifically, the query 
window Q is a line segment perpendicular to the plane t = starttime, and having length 
(endtime+5 starttime). Thus the line ends on the plane t = endtime+5. Using the 
spatiotemporal index we retrieve the set S of all the trajectories that intersect Q. These 
are the trajectories that cross R sometime between starttime and endtime+5. 

The refinement step is as follows. For each trajectory T of S we consider the 
trajectory segments that are "alive" between starttime and endtime, T is in the answer 
set if and only if at least one of them is within route-travel-time 5 minutes from R. 

(I As): ALONG EXISTING PATH/ ALWAYS BETWEEN ) assuming that the cost 5 is 
given in terms of distance: 

The query is formulated as follows: 

Q 4: 

SELECT id FROM M_0 

WHERE id WITHIN 5 km FROM R 

ALONG EXISTING PATH 

ALWAYS BETWEEN starttime AND endtime 

We again use a filter and refinement approach to process this query. The filter uses 
the spatiotemporal index, and it is as follows. The query window Q is a prism having 
a square G as its base on the plane t = starttime, and having height (endtime 
starttime). 

G is a square with R at its center of gravity, and with a side of length 2*5km. Using 
the spatiotemporal index we retrieve the set S of all the trajectories that intersect Q. 

All the trajectories of objects o that go through the point R in their route segment 
between LOC(o, starttime) and [LOC(o, endtime) plus route-distance 5km] are in S. 

The refinement step is as follows. For each trajectory T of S we consider the sub 
trajectory s between starttime and endtime. If the object is not "alive" on some 
segment of this sub-trajectory, then the trajectory is eliminated from S. 

Otherwise, let el be the location when T crosses R for the first time in the route 
segment between LOC(o, starttime) and [LOC(o, endtime) plus route-distance 5km]; 
let e2 be the location when T crosses R for the second time in the same interval; etc. 

Note that the time at el starttime by definition. If the route-distance between 
LOC(o, starttime) and el is more than 5km, then clearly the location of the object at 
starttime is more than 5km away from R along the trajectory, thus T is eliminated 
from S. Otherwise, i.e. if the route-distance between LOC(o, starttime) and el is not 
more than 5km, then we consider WHEN_AT(o,el). If WHEN_AT(o,el) endtime, 
this means that at endtime the object has not reached the point R yet; but then clearly, 
from its location at endtime the distance to R is not more than 5km, since at starttime 
it is not more than 5km away. We conclude that the trajectory satisfies the condition 
of the query, and the algorithm continues to examine the next trajectory. 

Otherwise, i.e. if endtime > WHEN_AT(o,el), we have to examine whether the 
sub- trajectory between el and LOC(o, endtime) satisfies the condition with respect to 
e2, e3, e4, etc. In order to verify this we assign el to starttime and repeat the proce- 
dure recursively. Specifically, if e2 does not exist or if e2 is more than 5km away 
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from LOC(o.starttime), then clearly the location of the object at starttime is more than 
5km away from R along the trajectory, thus T is eliminated from S. Otherwise, i.e. if 
e2 is not more than 5km away from LOC(o, starttime), then we consider endtime. If 
endtime <= WHEN_AT(o,e2), this means that at endtime the object has not reached 
the point R (for the second time) yet; but then clearly it will reach R again within 
5km. Thus the trajectory satisfies the condition, and the algorithm continues to exa- 
mine the next trajectory. Otherwise, i.e. if endtime > WHEN_AT(o,e2), we have to 
examine whether the sub-trajectory between e2 and endtime satisfies the condition 
with respect to e3, e4, etc. The trajectories that remain in S after the refinement step 
constitute the answer to the query. 

(IBs): ALONG EXISTING PATH/ SOMETIME BETWEEN ) assuming that the cost 5 
is given in terms of distance: 

The trajectories that satisfy this predicate are the trajectories for which R is within 
a route-distance of 5km from some point on the sub-trajectory between 
LOC(o, starttime) and [LOC(o, endtime) + route-distance 5]. 

The query window Q is a prism having a square G as its base on the plane t = 
starttime, and having height (endtime - starttime). G is a square with R at its center of 
gravity, and with a side of length 2*5km. Using the spatiotemporal index we retrieve 
the set S of all the trajectories that intersect Q. All the trajectories of objects o that go 
through the point R in their route segment between LOC(o,starttime) and 
[LOC(o,endtime) + route-distance 5] are in S. 

The refinement step is as follows. For each trajectory T of S we consider the 
trajectory segments that are "alive" between starttime and endtime. T is in the answer 
set if and only if at least one of them is within route-distance 5km from R. 

(2 As): ALONG SHORTEST PATH / ALWAYS BETWEEN, assuming that the cost 5 is 
given in terms of distance: 

Here we will use again the spatiotemporal index mentioned in previous sections. 
The query box Q must contain all the space that can be covered by an object starting 
from R moving along the shortest path in any direction between starttime and 
endtime. The query box Q will have the following dimensions: 

2D space: it will be a square centered at R having a side equal to 2*5km. (see Fig 

2 ). 

time: end time - starttime 




Fig 2. A projection of the query box Q to the x,y plane 
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It can be shown that the query box Q contains all the potential answers to the query 
2As. Let us call S the set of trajectories that intersect the query box Q . We obtain S 
after the filter step. Consider the refinement step. First we compute the set P of whole 
or partial route segments that are at distance at most 5km from R. The trajectories that 
satisfy the predicate 2As are the trajectories of S for which the following condition is 
satisfied: all the trajectory segments between starttime and endtime are "alive", and 
their projection on the (x,y) plane is contained in P. 

(2Bs): ALONG SHORTEST PATH / SOMETIMES BETWEEN, assuming that the cost 
5 is given in terms of distance: 

In this case it must be true that for at least one point in the route of the object 
between starttime and endtime, it is closer than 5km from R along the shortest path. 

Again we use the spatiotemporal index, and S as it results from the query 2As. The 
set P is also similarly defined. Finally, the trajectories that satisfy the predicate 2Bs 
are the trajectories of S for which the following condition is satisfied: some the 
trajectory segments between starttime and endtime are "alive", and their projection on 
the (x,y) plane is contained in P. 

(2At): ALONG SHORTEST PATH / ALWAYS BETWEEN, assuming that the cost 5 is 
given in terms of travel time: 

The filter step constructs the query Q as follows. First we compute the set M of 
route segments (consisting of the points) that are at travel-time at most 5 minutes 
from R. We do so by using the standard shortest path algorithm on the map graph. For 
each route segment r we construct a rectangle in 3D with base r on the plane t = 
starttime, and a height (endtime starttime) perpendicular to that plane. The query Q 
consists of all these rectangles. Then we retrieve the set S of trajectories that intersect 
this set of rectangles. 

The refinement step is as follows. For each trajectory T in S we compute the set 
T.M of route segments that the object travels between starttime and endtime. If the 
object is not alive on all these route segments, then T is eliminated from S. The 
trajectories that satisfy the query are: the subset of S consisting of each trajectory T 
for which T.M is a subset of M. 

(2Bt): ALONG SHORTEST PATH / SOMETIME BETWEEN, assuming that the cost 5 
is given in terms of travel time: 

The filter step is the same as in 2At. The refinement step is as follows. For each 
trajectory T in S we compute the set T.M of route segments that the object travels 
between starttime and endtime, and on which the object is "alive". The trajectories 
that satisfy the query are: the subset of S consisting of each trajectory T for which the 
intersection of T.M and M is nonempty. 



4 Related Work 

Temporal data models provide built-in support for capturing one or more temporal 
aspects of the database entities. It is conceptually straightforward to also associate the 
database entities with spatial values. Concrete proposals include a variant of Gadia's 
temporal model [14, Ch. 2], a derivative of this model [2], and STSQL [1]. 
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Essentially, these proposals introduce functions from the product of time and space to 
base domains, and they provide languages for querying the resulting databases. These 
proposals are orthogonal to the specifics of types and simply abstractly assume types 
of arbitrary subsets of space and time; no frameworks of spatiotemporal types are 
defined. The data model by Worboys [17] represents this approach. Here, spatial 
objects are associated with two temporal aspects, and a set of operators for querying is 
provided. However, this model does not provide an expressive type system, but 
basically proposes only a single type, termed ST-complex, with a limited set of 
operations. In addition, two papers exist that consider spatiotemporal data as a 
sequence of spatial snapshots and in this context address implementation issues 
related to the representation of discrete changes of spatial regions over time [12]. 

Reference [13] presents a model for moving objects along with a query language. 
However, the new language constructs there are purely temporal and nonstandard, and 
the model does not capture full trajectories. In contrast, the constructs in this paper are 
refinements of standard spatiotemporal range queries. 

Work in constraint databases is applicable to spatiotemporal settings, as arbitrary 
shapes in multidimensional spaces can be described. Papers that explicitly address 
spatiotemporal examples and models include [7, 3], However, this kind of work 
essentially assumes the single type "set of constraints," and is not concerned with 
types in the traditional sense. Operations for querying are basically those of relational 
algebra on infinite point sets. Recent work recognizes the need to include other 
operations, e.g., distance [7]. 

The Informix Dynamic Server with Universal Data Option covers type 
extensibility [8]. So-called DataBlade modules may be used with the system, thus 
covering new types and associated functions that may be used in columns of database 
tables. Of relevance to this paper, the Informix Geodetic DataBlade Module [8] 
covers types for time instants and intervals as well as spatial types for points, line 
segments, strings, rings, polygons, boxes, circles, ellipses, and coordinate pairs. 
Informix does not cover any integrated spatiotemporal data types. Limited spatiotem- 
poral data support may be obtained only by associating separate time and spatial 
values. The framework put forward in this paper provides a foundation allowing 
Informix or a third-party developer to develop a DataBlade that extends Informix with 
expressive and truly spatiotemporal data types. 

Since 1996, the Oracle DBMS has offered a so-called spatial data option, also termed 
a Spatial Cartridge, that allows the user to better manage spatial data [11]. Current sup- 
port encompasses geometric forms such as points and point clusters, lines and line 
strings, and polygons and complex polygons with holes. However, no spatiotemporal 
types are available in Oracle. The support offered by Oracle resembles the support 
offered by DB2's Spatial Extender [4], which offers spatial types such as point, line, and 
polygon, along with \multi-" versions of these, as well as associated functions, yielding 
several spatial ADT's. Like Oracle, spatiotemporal types are absent. 

A recent effort for a foundation of spatiotemporal data types and query language 
design appears in [5], where all the design is based on the moving point and moving 
region data types and the associated operations. 

One of the most recent efforts dealing with indexing of moving points is [10]. 
There the authors deal mainly with the problem of moving objects on a line and they 
propose a dual representation that is claimed to give logarithmic response time to 
range queries. That paper only covers indexing issues (not modeling or linguistic 
issues as we do here), and considers standard range queries. 
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In [18] the authors propose a comprehensive approach to embedding moving 
objects support in a commercial DBMS. Several issues are addressed such as: 
indexing moving objects, uncertainty regarding the position of an object, dynamic 
behavior evolving with time. However, the model and language constructs in this 
paper are new. 



5 Conclusions 

In this paper we introduced a model for moving objects on road networks. The road 
network is represented by an electronic map. The connection between the moving 
object trajectory model and the road network represents one of the innovative aspects 
of this work. We also defined a small set of expressive predicates that can effectively 
be implemented in terms of existing spatial operators within commercial 
database/GIS systems. The language we propose handles a large number of 
reasonable queries about moving objects. 

Although the model paradigm was build on GDT maps as a running example, the 
design of the predicates is applicable to other contexts in which objects move in a 
spatial grid. The design presented in the paper is directly applicable to such contexts, 
provided that the appropriate spatial grid model and associated support functions 
(such as the one for shortest path or distance computation) are provided. 

Another issue that arises is the representation of non polygonal trajectories. This 
case will be further investigated. As a first approach it can be treated in two ways: 
either approximated by polygonal lines using interpolation and or extrapolation 
techniques 

or represented by mathematical expressions - patterns- that represent trajectories 
in as a continuous function of time. 

Further work will be committed in the following directions: 

Deal with uncertainty features. This work will be based on initial work on spatial 
relationships [15]. 

Consider the dual model as the database representations scheme. 

Implementation of the predicates defined in the paper, and performance 
evaluation. We plan to have two parallel implementations based on different 
database products exploiting their respective type extension facilities. 
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Appendix A. Primitives Provided by Commercial Products 

In this section we refer to the spatial and other functions provided by commercial pro- 
ducts and are exploited in our design. The primitives indicates by * are provided by 
both Informix Spatial Datablade and ArcView. The ones without * are provided only 
by ArcView. 



Function name 


Semantics 


*contains(shape 1 ,shape2) 


returns TRUE if shape 1 contains shape2 


*intersect(shape 1 ,shape2) 


returns TRUE shape 1 intersect shape2 


along(polyline,dist) 


returns a point on a polyline, the route distance of 
which is "dist" length units away. 


selectby(shape) 


selects the features that intersect “shape” 


traveltime(polyline p). 


returns the travel time along the polyline p in time 
units 
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Although Arcview/Informix do not use the notion of time, they use the notion of 
cost. So given GDT maps, cost can be interpreted as travel-time or distance. IBM 
Spatial Extender [4] provides the following functions for spatial data manipulation: 

Comparison functions: contains, cross, disjoint, equals, intersects, overlap, touch, 
within, envelopes intersect 

Relationship functions: common point, embedded point, line cross, area intersect, 
interior intersect, etc. 

Combination functions: difference, symmetric difference, intersection, overlay, 
union, etc. 

Calculation functions: area, boundary, centroid, distance, endpoint, length, 
minimum distance, etc. 

Data-exchange functions: astext (for Open GIS well-known text representation), 
asbinary (for Open GIS well-known binary representation), asbinaryshape (for ESRI 
shape representation), etc. 

Transformation functions: buffer, locatealong, locatebetween, convexhull, etc. 
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Abstract. Like people who casually assess similarity between spatial scenes in 
their routine activities, users of pictorial databases are often interested in 
retrieving scenes that are similar to a given scene, and ranking them according 
to degrees of their match. For example, a town architect would like to query a 
database for the towns that have a landscape similar to the landscape of the site 
of a planned town. In this paper, we develop a computational model to 
determine the directional similarity between extended spatial objects, which 
forms a foundation for meaningful spatial similarity operators. The model is 
based on the direction-relation matrix. We derive how the similarity assessment 
of two direction-relation matrices corresponds to determining the least cost for 
transforming one direction-relation matrix into another. Using the 
transportation algorithm, the cost can be determined efficiently for pairs of 
arbitrary direction-relation matrices. The similarity values are evaluated 
empirically with several types of movements that create increasingly less similar 
direction relations. The tests confirm the cognitive plausibility of the similarity 
model. 



1 Introduction 

Similarity is an intuitive and subjective judgment, which displays no strict 
mathematical models (Tversky 1977). In their routine activities, people casually assess 
similarity between spatial scenes. Similarity also matters when users of spatial 
databases want to retrieve spatial scenes that resemble a sketched configuration and 
want to rank the results according to degrees of their match. Computational methods 
that match with users’ intuitive expectations are the focus of this paper. 
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A number of models and systems for spatial similarity retrieval have been already 
developed. For example. Query by Image Content (Flickner et al. 1995) allows a user 
to retrieve images from a database based on the contents of images. Graphic 
representations of images store geometric and visual attributes of objects and spatial 
relations between them. The geometric attribute of an object refers to its spatial extent, 
and visual attributes refer to color, shape, and texture (Gonzalez and Woods 1992). 
Geometric and visual attributes help in determining the presence of an object in a 
scene and spatial relations between objects distinguish relative placements of the 
objects in the embedding space. Combining object similarity and spatial relation 
similarity, one can make a query such as “Find scenes where object A and B are 
present, B is north of A, and A disjoint B.” Nabil et al. (1995) and Bruns and 
Egenhofer (1996) use spatial relations between objects for the assessment of scene 
similarity. Spatial relations are also used for similarity assessment in image databases 
(Chu et al. 1994; Del Bimbo et al. 1995; Del Bimbo and Pala 1997; Chu et al. 1998), 
multimedia databases (Al-Khatib et al. 1999; Yoshitaka and Ichikawa 1999), and 
video databases (Jiang and Elmagarmid 1998; Pissinou et al. 1998; Aslandogan and 
Yu 1999). In order to use spatial relations for similarity assessment, we need 
cognitively plausible methods to assess similarity between spatial relations. 

Cardinal directions play an important role in the specification of spatial 
configurations. For example, the query scene (Figure la) and the three scenes in the 
database (Figures lb-d) contain objects A and B whose topological relation is always 
disjoint; therefore, the three scenes are topologically equivalent to the query scene. 
However, when considering the cardinal direction as an additional search criterion, 
one can determine that Scene lb is the most similar to the query scene. Also, when 
considering directions, one can determine that Scene lc is more similar to the query 
scene than Scene Id. 






(a) 
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Fig. 1: (a) The query scene and (b)-(d) scenes 0, 1, and 2 in a database. 

Earlier models for spatial-relation similarity resorted to minimum-bounding 
rectangles (Egenhofer 1997; Papadias and Delis 1997), representative points 
(Gudivada and Raghavan 1995), or projections along the coordinate axes (Sistla et al. 
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1995) to model direction relations. These methods are crude approximations that often 
lead to incorrect directions when concave region objects are involved. The direction- 
relation (Goyal and Egenhofer, in press) overcomes these deficiencies as it avoids 
generalizations to points or approximations by rectangles. The method also extends to 
other geometric types such as lines and points (Goyal and Egenhofer 2000). This 
paper develops a computational method for assessing similarity of cardinal directions 
that are modeled with the detailed direction-relation matrix (Goyal and Egenhofer, in 
press). The better-quality direction-relation model displays some adverse properties 
that make it infeasible to apply the usual approaches for determining relation 
similarity; therefore, we introduce a new approach to determining spatial similarity 
that is based on the transportation algorithm. The method is cognitively plausible as 
well as computationally efficient. 

The remainder of this paper is structured as follows. Following a short summary of 
the direction-relation matrix (Section 2), we introduce distance measures for pairs of 
simple (Section 3) and complex (Section 4) direction-relation matrices. Section 5 
formulates the similarity assessment of two direction-relation matrices as a minimum- 
cost transformation. In Section 6 we demonstrate that the results of this similarity 
assessment method are cognitively meaningful by comparing the trends of similarity 
values obtained for moving objects with an expected baseline. Section 7 contains 
conclusions and discusses future work. 



2 Direction-Relation Matrix 

A cardinal direction is a binary relation involving a reference object A and a target 
object B, and a symbol that is a non-empty subset of {N, S, E, W, NE, SE, SW, NW, 
0}. The symbols are spatially organized according to a grid formed around the target 
object (Figure 2), creating nine direction tiles (Figure 1): north (N4), northeast (NE4), 
east (E4), southeast (SE4), south (S4), southwest (SW4), west (W4), northwest 
(NWa), and same (0 4 ). 




Fig. 2: The nine direction tiles formed around the reference object A. 

The detailed direction-relation matrix (Goyal and Egenhofer, in press) is a 3x3 
matrix that captures the neighborhood of the partition around the reference object and 
registers for each tile how much of the target object falls into it (Equation 1). The 
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elements in the direction-relation matrix have the same topological organization as the 
partitions around the reference object. 



area(NW A n B) 


area(N^ n b) 


area(NE A n B 


area (B) 


area (B) 


area (B 


areafW^ n b) 


area(0 A n b) 


area(E A n b) 


area (B) 


area (B) 


area(R) 


area(SW A n b) 


area(S A n b) 


area(SE A n B) 


area (B) 


area (B) 


area(R) 



( 1 ) 



For example, the detailed direction-relation matrix for the configuration shown in 
Figure 1 has six elements of zero, and three non-zero elements which sum up to 1.0 
(Equation 2) 



dir(A.B) 



0 

0 

_0 



0.05 0.45 
0 0.50 

o a 



( 2 ) 



A direction-relation matrix with exactly one non-zero element is called a single- 
element direction-relation matrix, and the corresponding relation is a single-element 
direction. There are nine single-element directions corresponding to nine cardinal 
directions. A direction-relation matrix with more than one non-zero element is called a 
multi-element direction-relation matrix, and its corresponding direction is a multi- 
element direction. Subsequently, we develop a method to compute distances between 
single-element directions (Section 3), followed by an extension for multi-element 
directions (Section 4). 



3 The Distance between Two Single-Element Direction-Relation 
Matrices 

As a quantitative measure for direction similarity, we introduce a distance measure 
between two cardinal directions, such that (1) a zero value implies that both directions 
are identical and (2) distance (D 0 , D x ) > distance (D 0 , D 2 ) means D 0 is more similar to 
Z) 2 than Z) 0 to D u where D 0 , D u and D 2 denote cardinal directions from it to A in 
Scenes 0, 1, and 2, respectively. A conceptual neighborhood graph for a set of 
mutually exclusive spatial relations serves as the basis for computing distances 
between the relations in this set. Conceptual neighborhood graphs have been used for 
deriving distance measures between 1 -dimensional interval relations (Freksa 1992), 
topological relations in R 2 (Egenhofer and Al-Taha 1992), and relations between 
minimum bounding rectangles (Papadias and Delis 1997). A continuously changing 
relation follows a path along the conceptual neighborhood graph. For example, if a 
target object B moves eastward from the northwest tile (Figure 3a) it cannot move 
directly to the northeast tile (Figure 3c), because it must go through the direction 
tile(s) that connect the northwest and northeast tiles. The shortest path would lead 
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through the north tile (Figure 3b), although other connected paths are possible as well 
(e.g., through the west, same, and east tiles). 




Fig. 3: The shortest path to move the target object B from the northwest tile to the northeast tile 
is through the north tile. 

In order to compute the distance between cardinal directions, we construct a 
conceptual neighborhood graph for the nine cardinal directions using the 4- 
neighborhood of the nine tiles. This graph has a vertex for each cardinal direction and 
an edge for each pair of cardinal directions that are horizontally or vertically adjacent 
(Figure 4). The distance between two cardinal directions is the length of the shortest 
path between two directions along the conceptual neighborhood graph (Figure 5). The 
distance between two identical directions is zero, which is the shortest of all distances. 
The distance between the cardinal directions northwest and southeast is four, which is 
the maximum. The only other pair with the maximum distance is northeast and 
southwest. The distance function abides by the axioms of a distance — positivity, 
symmetry, and triangle inequality. 




Fig. 4: The conceptual neighborhood graph for nine cardinal directions based on the 
4-neighborhood between tiles. 
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Fig. 5: Four-neighbor distances between cardinal directions for regions. 

For example, the distance between the directions of B with respect to A along the 
conceptual neighborhood graph in Figures 3a and b is 1, while the distance between 
the directions in Figures 3a and c is 2. Based on these distances, we infer that the 
direction of B with respect to A in Figure 3a is more similar to the direction in Figure 
3b than the direction in Figure 3a to the direction in Figure 3c. 

The distance measure between single-element direction-relation matrices serves as 
the basis for the distance between multi-element direction-relation matrices. 



4 Distance between Two Multi-Element Direction-Relation 
Matrices 

To account for distances between multi-element direction-relation matrices we extend 
the method used for computing the distance between cardinal directions from single- 
element direction-relation matrices. If the target object B moves eastward from the 
northwest tile (Figure 6a) to the northeast tile (Figure 6e), its trajectory moves 
successively over the northwest and north tiles (Figure 6b), the north tile alone 
(Figure 6c), and the north and northeast tiles (Figure 6d). The directions in Figures 6b 
and d require multi-element direction-relation matrices for their representation. 
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Fig. 6: The target object moves across single as well as multi-element cardinal directions from 
(a) northwest through (b) northwest and north, (c) north, (d) north and northeast, to (e) 
northeast. 



Definition 1: The distance between two arbitrary direction-relation matrices, D° and 
D 1 , is the minimum cost for transforming matrix D° into D ] by moving the non-zero 
elements of D° from their locations to the locations of the non-zero elements of I)' 
along the conceptual neighborhood graph. 

The cost of this transformation is the weighted sum of the distances along the 
neighborhood graph between the source and destination direction tiles, where a source 
refers to a cardinal direction from where a non-zero element is moved and a 
destination refers to a cardinal direction where the element is moved to. The weighting 
of a distance between a source and a destination is done by the element values moved 
between them. For example, transforming matrix D° (Equation 3a) into D (Equation 
3b) requires the movement of the value 0.4 from northwest to northeast, and the value 
0.6 from north to northeast. The cost of this transformation is 0.4 x distance (NW, 
NE) + 0.6 x distance (N, NE), which is 0.4 x 2 + 0.6 x 1 = 1.4. 



D 



D l = 



0.4 0.6 

0 0 



.0 
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0 
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0 

0 

0_ 
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0 

0 



(3a) 



(3b) 



The remainder of this section introduces consistency constraints and properties of 
intermediate matrices, which are needed to develop the method for distance 
computation between arbitrary direction-relation matrices. 

Definition 2: The sum of a matrix P is defined as the sum of the values of it elements 
(Equation 4). 

sum( P) := XX 

Vi V; (4) 

Definition 3: The commonality C 01 between two direction-relation matrices, D° and 
D l , is a 3x3 matrix defined as the minimum of each pair of corresponding element 
values (Equation 5). 
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Vi,;: C” 1 - min(D°,D‘) (5) 

The values of the elements in C 01 lie in the interval [0, 1]. The value of sum (C 01 ) 
also lies in the interval [0, 1]. It is 0 if all the corresponding pairs of elements have at 
least one 0. It would be 1 if the distribution of the target objects in both D° and /) 
were identical. Since the minimum of a set of numbers is unique and does not depend 
on their order, the calculation of the commonality is commutative (Equation 6). 



For example, the commonality matrices C 01 and C 10 for directions of B with respect 
to A in Scene 0 and Scene 1 (Figure 7) have the same values (Equation 7). 



Scene 0 Scene 1 



i r 
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0.06 
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D l = 


0 


0.21 


0.01 
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0 _ 




0 


0.72 


0.06 



Fig. 7: A direction comparison of Scenes 0 and 1 with identical objects, but different 

directions D° and D l , respectively. 



c 01 =c 10 



0 0 0 

0 0.06 0 
0 0 0 



(7) 



Definition 4: The asymmetric difference R m between two direction-relation matrices, 
D° and D 1 , is defined as the difference of the direction-relation matrix D° and the 
commonality (Equation 8a), and has a corresponding value for R 10 (Equation 8b). 



R 01 := D° - C 01 
R 10 := D l - C 10 



(8a) 

(8b) 



We use the term non-zero part of a matrix for a non-zero element or a fraction of a 
non-zero element. The asymmetric difference R m has the distinct non-zero parts of D° 
that are not in D l . Conversely, R u> (Equation 8b) has the distinct non-zero parts of if 
that are not in D°. For example, the asymmetric difference matrix R m (Equation 9a) 
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for the scenes in Figure 7 has no non-zero parts that are in D l , while R 10 (Equation 9b) 
has no non-zero parts that are in D°. 



R 



01 



0 0.84 0.10 
0 0 0 
0 0 0 



(9a) 



R 



10 



0 0 0 
0 0.15 0.01 
0 0.72 0.06 



(9b) 



The values of elements in R oi and A’ 10 lie in the closed interval [0, 1], The values of 
sum (R 01 ) and sum (R 10 ) also lie in the interval [0, 1], The value 0 for sum (R 10 ) means 
there is no difference between matrices D 0 and D u whereas the value 1 means there is 
no commonality between matrices D 0 and D x . 

Definition 5: The direction-difference (A 01 ) between two direction-relation matrices, 
D° and D 1 , is defined as the difference of the two relations’ asymmetric differences 
(Equation 10). 



A 01 t= R° l - R l ° (10) 

The values of the elements in A 01 lie in the interval [-1, 1]. In order to express the 
direction-difference in terms of direction-relation matrices, we substitute the value of 
R oi and R 10 (Equations 8a and b) in Equation 10, and obtain Equation 11, as 
commonalty matrices C 01 and C 10 cancel each other. For example. Equation 12 gives 
the direction-difference matrices A 01 for the scenes in Figure 7. 



oi „o 1 

A =D -D 



( 11 ) 



0 0.84 0.10 

0 -0.15 -0.01 
.0 -0.72 -0.06. 



( 12 ) 



Theorem 1: The sum of the elements in R 01 equals the sum of the elements in A' 10 . 
Proof: The sum of the elements of a detailed direction-relation matrix is 1 (Equation 
13). 

sum(D°) = sum(D l ) = 1 (13) 



The addition operation (+) follows the associative law; therefore, we can express 
the asymmetric difference matrices (Equations 8a-b) in the sum form (Equations 14a- 

b). 



sum(R 01 ) = sum(D°) - sum(C° l ) 
sum(R ) = sum(D ) - sum(C ) 



(14a) 



(14b) 
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Let us assume the value of sum (C 01 ) is x, which equals sum (C 10 ); substituting x for 
these sums in Equation 14a-b and combining them with Equation 13, we obtain 
expressions for the sum of asymmetric difference matrices in terms of x (Equations 
15a-b). 

sum(R ) = 1 - x 
sum(R ) = 1 - x 

The right hand sides of both asymmetric difference matrices’ sums are identical 
(Equations 15a-b), which proves the theorem (Equation 16). 

sum( R ) = sum{R ) (16) 



(15a) 

(15b) 



Corollary 2: The sum of the elements in a direction-difference matrix is 0. 

Proof: We can express the direction-difference matrix (Equation 10) in the sum form 
(Equation 17). Since the values of the sums of the asymmetric difference matrices’ 
elements are identical (Equation 16), their differences cancel each other (Equation 
18). 

sum ( A ) := sum(R )— sum(R ) (\1) 

sum( A 01 ) := 0 (18) 

For example, the sum of elements of A 01 (Equation 12) is zero. Non-zero elements 
of the commonality matrix C 01 capture the common non-zero parts of the matrices D° 
and D l ; therefore, the non-zero parts that correspond to non-zero parts of C 01 must not 
be moved while transforming D° into /) (Definition 1). Moving them would increase 
the cost of this transformation, such that the computed cost for this transformation 
would not be the minimum value. Only those non-zero parts of D° should be moved 
that are zero in D l , which means only non-zero elements of R m must be moved to 
obtain R w . 

Definition 6: The distance between two matrices D° and D ] is the minimum cost 
incurred in transforming R oi into R 10 . 

Theorem 3: The maximum 4-neighbor distance ( distance ^ ) between two direction- 
relation matrices is 4. 

Proof: The maximum cost is incurred when the maximum possible value of sum (R 01 ) 
is moved by the maximum possible distance. The maximum value of sum (R <>] ) is 1 
and the maximum 4-neighbor distance between two cardinal directions is 4 (Figure 5); 
therefore, the value of distance‘s is 4x1=4. 

The maximum distance between two direction-relation matrices can occur only 
between single-element direction-relation matrices that have non-zero values for the 
farthest direction tiles. For example, the value of 4-neighbor distance between two 
single-element direction-relation matrices with non-zero values in the southwest and 
northeast tiles is 4. In the direction-difference A 01 , non-zero elements that correspond 
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to non-zero elements of ft 01 are of positive polarity, while non-zero elements that 
correspond to non-zero elements of ft 10 are of negative polarity (Equation 10). The 
sum of the elements of the matrix A 01 is zero (Corollary 2). The matrix A 01 has all the 
necessary information to compute the minimum cost of transforming D° into D\ 
which is the same as the minimum cost of transforming R m into R u> . 



5 Minimum Cost Solution 



The problem of determining the minimum cost for transforming matrix R (n into 
matrix ft 10 can be formulated as a balanced transportation problem (Murty 1976; 
Strayer 1989), which is a special case of the linear programming problem (Dantzig 
1963; Dantzig and Thapa 1997). A transportation problem records the supplies of all 
the warehouses, the demands of all the markets, and the unit costs for all the pairs of 
the warehouses and the markets. This setting applies to the direction difference matrix 
A 01 , where each positive element in A 01 corresponds to a warehouse, whereas each 
negative element corresponds to a market. The supply of the z'th warehouse IT; is Si, 
which equals the magnitude of the corresponding element in A 01 . Similarly, the 
demand of the /th market Mj is dj, which also equals the magnitude of the 
corresponding element in A 01 . The cost cy for moving a unit supply from IT, to M ) is 
distance (W v Mj). The sum (R 01 ) equals the sum (ft 10 ) (Theorem 1); therefore, the sum 
of the supplies of all warehouses is the same as the sum of the demands of all markets, 
which identifies a balanced transportation problem. 

If is the number of units to be shipped from the warehouse VTj to the market Mj in 
the final solution, then the goal of the transportation problem is to determine the 
values of x such that the total cost z (Equation 19) is a minimum such that the 
warehouse and market constraints are satisfied (Equations 20a-b). 



p n 



z = ZL c a x ii 

i= 1 j= 1 


(19) 


n 


Vi 6 W : s = 7 ,x.. 

i <j 

7=1 


(20a) 



V/ e M : d. = / 






(20b) 

While each constraint can be formulated as a linear equation, the system of the 
warehouse and market constraints is underdetermined in most cases, such that the 
common algebraic methods for linear equations do not provide a unique solution. For 
the direction difference with nine elements, a total of twenty different scenarios may 
occur. In the worst case, there are 4*5=20 unknown elements with 2+4=6 warehouse 
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and market constraints (Equations 20a and b) and one additional constraint for the 
minimum cost (Equation 19). Other scenarios may be less underconstrained — the 
direction difference for the example in Figure 7 has eight unknown variables with 
seven constraints total — or potentially overconstrained as in the case of a direction 
difference with 1 warehouse and n markets. To accommodate for these variations, 
typically the transportation problem is solved in two phases: (1) finding a basic 
feasible solution and (2) improving the basic feasible solution iteratively, until an 
optimal solution is obtained. 

A basic feasible solution is a set of .r values that satisfy the market and warehouse 
constraints, but it may not give the minimum value of z. There can be more than one 
set of x values that yield the same minimum value z. A common algorithm to 
determine a basic feasible solution is the northwest 1 corner method (Strayer 1989, p. 
180), while the minimum can be obtained through the transportation algorithm 
(Strayer 1989, p. 153). Since both algorithms are classic, we do not repeat them here 
but refer the interested reader either to a textbook (Strayer 1989) or the details of their 
use for calculating direction similarity (Goyal 2000, pp. 100-108). 

The example in Figure 7 formulates 2+4=6 warehouse and market constraints 
(Equations 21a-f) and the cost function (Equation 21g). 



X[[+ X^ 2 +X^ 3 +X^ 4 — 0.84 


(21a) 


*21+*22+*23+-*24 = 0.10 


(21b) 


+n +-Di=0.06 


(21c) 


*12+*22 = 0.72 


(2 Id) 


-V| 3 ++23 = 0.0 1 


(21e) 


*14+*24=0.15 


(2 1 f) 


~= 3.\' i + 2 x,2+2X| 3+X| 4+2X2] +3x22^”+23”^7x24 


(2 1 g) 



The northwest corner method offers then a basic feasible solution at z fe asibie= 1-89 
(Equation 22a), for which the transportation algorithm finds a minimum at 
Zminim,im=1.75 (Equation 22b). 

"=3*0.06+2*0.72+2*0.01 + 1*0.05+2*0.10 (22a) 

"=2*0.72+1*0.12+2*0.06+1*0.01+2*0.03 (22b) 

Although the simplex method has a non-polynomial upper bound, it requires an 
average of 3 m pivot steps, where m is the number of constraints in the problem. For 
the 3x direction-relation matrix the largest value for m is 9, therefore, the simplex 



1 The term northwest corner refers here to the top-left cost value, that is, the first row and first column in 
the transportation tableau used to solve the transportation problem. It should not be confused with the 
northwest tile in the direction-relation matrix. 
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method can be performed in at most 27 pivot steps. In comparison, relying on a graph 
of all possible movements could result in a graph with as many as 17,250,360 paths 
(Goyal 2000). Therefore, the method based on the transportation algorithm is 
significantly more efficient. 

Definition 7: The dissimilarity of two pairs of cardinal directions expressed in the 
form of two direction-relation matrices D° and D l , corresponds to the minimum cost 
of transforming D° into /)', normalized by the highest possible cost (Equation 23). 



^i N minCost(D°,El) 
dissimuanty(D ,D ) := j 

Definition 8: The similarity s of two pairs of cardinal directions is the complement of 
the dissimilarity (Equation 24). 

s(D , I) ) := 1 - dissimilarity! D J) ) (2A) 

Applying the similarity assessment method to the query scene in Figure la and 
Scenes 0-2 in the database (Figures 5. lb-d), the similarities values are s(D q , D°) = 
0.92, s(D q , D l ) = 0.56, and s(D'\ D 2 ) = 0.04. From these values, we can infer that the 
direction in the query scene is most similar to the direction in Scene 0, followed by 
Scene 1 , and Scene 2, as the similarity is decreasing in this order. 



6 Evaluation 

The subsequent benchmark consists of a set of successive changes in the geometry of 
the involved objects, making deformations that create an increasingly less similar 
situation. If the similarity values capture appropriately the decrease in similarity, then 
these results provide an indication that the similarity values are meaningful. 

For a systematic evaluation of the method for similarity assessment, we start with a 
query scene containing a pair of reference and target objects, and generate scenes by 
gradually changing the geometry of the target object. Throughout these changes, we 
capture the direction similarity between the query scene and the test cases and 
examine the trend of the similarity values. A cognitively plausible similarity 
assessment method is expected to give the highest similarity value (100) to the scene 
without any change and continuously decreasing values of similarity with increasing 
changes. 



6.1 Moving the Target Object over the Reference Object 

In this test, the target object B is moved iteratively from the northeast of A to the 
southwest of A. The expected graph is a straight line from the top left (100) to the 
bottom right (0). The eight different states and their corresponding similarity values 
with respect to the query scene (Figure 8) lead to a strictly monotonic decrease in 
similarity (Table la). The minor deviations from the expected graph are due to 
varying increments at which the snapshots were taken. 




Similarity of Cardinal Directions 49 




Similarity =0.43 



Similarity=0.2 



Similarity=0.12 



Similarity=0 



Fig. 8: Detailed diagonal movement of a target object, together with the corresponding 

similarity values. 

Two similar assessments confirm this pattern and provide additional insights about 
the properties of the direction-similarity assessment. Figures 9 and 10 show two 
diagonal movements of objects of different sizes — B larger than A, and A larger than 
B, respectively — recorded at coarser levels of detail than in Figure 8. The resulting 
graphs (Table lb and lc) have the same strictly monotonic decline; therefore, the 
similarity assessment shows the same trend for movement, independent of the objects’ 
sizes as well as the increments at which they are recorded. 
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Fig. 9: Coarse diagonal movement of a larger target object, together with the corresponding 

similarity values for direction relations. 
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Fig. 10: Coarse diagonal movement of a smaller target object, together with the corresponding 
similarity values for direction relations. 
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6.2 Rotating the Target Object around the Reference Object 

In the second set of tests, the target object B located in the northwest tile is rotated 
around the reference object A without intersecting with A. The expected pattern for 
this full-circle rotation is a symmetric v-shaped graph, where the highest dissimilarity 
is obtained when the target object has been rotated by 180°. 

The 17 states with their corresponding similarity values (Figure 9) lead to a strictly 
monotonic decrease in similarity, followed by a strictly monotonic increase 
(Table Id). This pattern also confirms that the same trend would be observed if the 
orientation of the rotation changed from clockwise to counterclockwise. 
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Fig. 11: Clockwise, full-circle rotation of the target object around the reference object, together 
with the corresponding similarity values for direction relations. 

Two variations of this test confirm the general trend. The sequences in Figure 12 
show a half-cycle rotation where A and B overlap partially. The expected graph is a 
monotonically falling line, without reaching 0 at the end, because the query and the 
last scene differ in less than the maximum distance. Table le confirms this 
expectation. 
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Fig. 12: Clockwise, semi-circle rotation of a target object around a reference object, together 
with the corresponding similarity values for direction relations 

Another type of rotation is examined by spinning an elongated object B around an 
axis such that some of B always remains to the northwest of the reference object A, 
while parts of B move across A (Figure 13). The expected similarity profile is the 
characteristic v-shape of a full-circle rotation (Table Id) with a low point around 50%, 
however, because B does not make a full-circle transition to the opposite end of A’s 
northwest corner. The graph (Table If) confirms this behavior of the similarity 
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Fig. 13: Clockwise 360° rotation of a target object, together with the corresponding similarity 
values for direction relations 
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6.3 Scaling Up the Target Object 



A third set of test cases analyzes the effects of scaling up the reference target B such 
the B’s direction with respect to A is affected. First B is in the northwest tile and 
grows continuously (Figures 14), and then the same process is repeated with B starting 
due north of A (Figure 15). In both cases the anticipated changes in direction are 
mono tonic, which are confirmed through the similarity graphs (Table lg and h). 
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Fig. 14: Scaling up a target object located in the northwest tile of the reference object 
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Fig. 15: Scaling up a target object located in the north tile of the reference object. 

Table 1. The direction profiles of the movements for diagonal movements (a-c), rotations (d-f), 

and scalings (g-h). 






(g) 



(h) 



Similarity of Cardinal Directions 53 



7 Conclusions and Future Work 

The similarity assessment for direction-relation matrices corresponds to cognitively 
plausible cases, and provides a computationally efficient method via the transportation 
algorithm. Unlike other direction-relation representations, the direction-relation matrix 
provides an accurate representation of direction relations without resorting to such 
crude approximations as centroids or minimum-bounding rectangles. The 
computational model for assessing direction-relation similarity gave new insights for 
methods that may be useful in such settings as sketch-based retrieval of spatial 
configurations. 

While we demonstrated through commonsensical evaluations the plausibility of the 
approach, it might be of interest to investigate other methods for determining the 
distances within the conceptual neighborhood graph among tiles. Goyal (2000) 
already demonstrated that the use of the 8-neighborhood of direction tiles in lieu of the 
4-neighborhood leads to counterintuitive similarity values. Are there other 
neighborhood configurations that would give better results than the 4-neighborhood? 
Additional evaluations with human subject tests will be needed to confirm the 
concordance with people’s judgments so that meaningful spatial operators for query 
languages can be obtained. The overall method for determining similarity of direction 
relations is flexible enough to accommodate such variations in the distances between 
single-element direction relations. 

A significant side product of this method of similarity assessment is the new insight 
that sequences of direction-similarity values for continuously moving objects may be 
used as a high-level description of the objects’ movements. These sequences not only 
capture the closest relations, but also provide characteristic profiles for high-level 
spatial concepts. For example, a particular graph may link directly to a type of 
deformation. Therefore, similarity profiles of the sort presented in Table 1 may 
capture high-level types of spatial changes, which then translate into meaningful 
cognitive or linguistic spatial concepts. In particular, the combined assessment of such 
direction profiles with topological profiles (Egenhofer and Al-Taha, 1992) appears to 
be very productive. 
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Abstract. In dynamic spatio-temporal environments where objects may 
continuously move in space, maintaining consistent information about 
the location of objects and processing motion-specific queries is a chal- 
lenging problem. In this paper, we focus on indexing and query process- 
ing techniques for mobile objects. Specifically, we develop a classification 
of different types of selection queries that arise in mobile environments 
and explore efficient algorithms to evaluate them. Query processing algo- 
rithms are developed for both native space and parametric space indexing 
techniques. A performance study compares the two indexing strategies 
for different types of queries. 



1 Introduction 

Increasingly, emerging application domains require database management sys- 
tems to represent mobile objects and support querying on the motion properties 
of objects cam For example, a traffic monitoring application may require 
storage of information about mobile vehicles (e.g., buses, taxicabs, ambulances, 
automobiles). Similarly, tanks, airplanes, and military formations in a war sim- 
ulation application need to be represented. A weather simulation would need 
representation and querying over trajectories of various weather patterns, e.g., 
storm fronts. Motion properties of objects are dynamic - i.e. , the location of 
mobile objects changes continuously as time passes, without an explicit update 
to the databascQ. This is in contrast to static objects traditionally supported in 
databases whose value changes only with an explicit update. 

Motion information of objects can be represented in the database using mo- 
tion parameters and location functions which compute the position of the object 
in space at any time. Different such parameters and functions may be associated 
with different objects, based on the type of their motion. As an example, con- 
sider an object that translates linearly with constant velocity in 2-dimensional 
space. The motion parameters in this case correspond to the object’s starting 
location (x s ,y s ) at some initial time t s and its velocity (v x ,v y ) along the spatial 
dimensions. Using these parameters the location of the object at any future time 
t > t s can be determined. 

1 Other object properties besides location can also be dynamic, e.g., fuel level of a 
moving vehicle 
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Types of Queries: Similar to queries in traditional spatio-temporal data- 
bases over static objects, queries over mobile objects involve projection, selec- 
tion, join, and aggregation operations. In this paper, we focus on selection queries 
since they are the building block for more complex SQL queries. Selection queries 
may be classified as either range or nearest neighbor queries. In the first case, ob- 
jects that fall within a given spatio-temporal range are retrieved. In the second 
one, we are interested in objects that are relatively “closer” to the query point. 

The notion of proximity differs between time and the spatial dimensions. 
In the temporal sense, proximity means co-occurence within a certain time pe- 
riod. Spatially, it refers to geographical/geometrical closeness. Thus, it is highly 
desirable to separate the selection predicate into spatial and temporal compo- 
nents, each of which can be specified in a different manner, either as a k-Nearest 
Neighbor or Range predicate. 

In Fig. |Jthe types of queries that are meaningful are marked with (ff). We 
will now give examples of queries handled in our system. 





Temporal kNN 


Temporal Range 


Spatial kNN 


X 




Spatial Range 


V 


V 



Fig. 1 . Spatio-Temporal Selection Query Classification 



• A Spatio-temporal Range Query is specified by both a spatial and a 
temporal range. An example is “retrieve all vehicles that were within a mile of 
the location (spatial range) of an accident between 4~5pm (temporal range)”. 

• A Temporal kNN Query is specified with a spatial range and a nearest 
neighbor predicate on the temporal value. An example is “retrieve the first ten 
vehicles that were within 1 00 yards (spatial range) from the scene of an accident 
ordered based on the difference between the time the accident occured and the 
time the vehicle crossed the spatial region (temporal kNN predicate)”. Note that 
in a temporal kNN query, we are sometimes interested only in the most recent 
past or in upcoming future events but not in both. In such cases, the tempo- 
ral kNN query will expand the search only in one direction of the temporal 
dimension. 

• A Spatial kNN Query is specified with a temporal range and a kNN 
predicate on the spatial dimensions. For example, “retrieve the five ambulances 
that were nearest to the location of the accident between 4~5pm” . 

• A Spatio-Temporal kNN Query, marked with (x) in Fig. [Q would 
specify a kNN predicate in both the spatial and temporal dimensions. Such a 
query type is not possible since it would imply that a total order existed between 
the temporal and the spatial dimensions. 
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Contributions: We explore two approaches - native space indexing (NSI) 
and parametric space indexing (PSI) and we have developed algorithms for eval- 
uating all previously described query types, using both of these approaches. In 
the NSI approach, motion is indexed in the original space in which it occurs, 
while in the PSI approach a parametric space defined by the motion parameters 
of the objects is used. Work on indexing mobile objects has recently been stud- 
ied in 1230112; we w iU discuss the state of the art in the following section. This 
paper makes the following contribution to the existing literature. First, while 
most existing approaches have explored techniques to support spatio-temporal 
range queries over mobile objects, we also develop techniques for temporal and 
spatial kNN queries as described above, by appropriately adapting techniques 
proposed for kNN in um. Furthermore, the emphasis of existing techniques 
[lb II 2l has been on the PSI strategy. In contrast, we explore both NSI and PSI 
strategies and compare them for different types of queries. Our results actual- 
lly illustrate that NSI almost always performs better compared to PSI strate- 
gies for all query types. We provide an intuitive justification for this behav- 
ior. 



Roadmap of the Paper: The rest of the paper is organized as follows. 
Sect. Q discusses related work on indexing of motion. Sect. 0 discusses two mo- 
tion indexing techniques (native space- and parametric space- indexing) and 
corresponding query processing techniques for various query types. In Sect. 0 we 
evaluate these techniques. Sect . 0 concludes the paper. 



2 Related Work 

Techniques for indexing multidimensional data (including spatial, temporal, and 
spatio-temporal data) have been extensively studied in the literature. Example 
data structures developed include R-tree and its family mm, Quadtree m. 
and hB-tree 0. Specialized techniques to index temporal objects have also been 
proposed, including multi-version index structures, e.g., Time-Split B-tree (TSB- 
tree) 0, multi- version B-tree 0. and others, summarized in Da- While the focus 
of these data structures has been on indexing temporally changing attribute 
values, they can be adapted to index spatio-temporal objects by replacing the 
key attribute (that is indexed using B-tree) by the spatial location, which will 
be indexed by a spatial data structure. 

Existing proposals for spatio-temporal index structures have been summa- 
rized in E3- The primary difference between the multi-version index structures 
and the spatio-temporal index structures is that in a multi-version index the 
temporal dimension is differentiated from other dimensions. The splitting policy 
focuses on temporal split with the objective to keep the “live” data which is cur- 
rently active clustered together. In contrast, spatio-temporal index structures do 
not differentiate between the spatial and temporal dimensions. 

The focus of the above described spatial, temporal, and spatio-temporal data 
structures has been on representing static objects - ones whose value changes 
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only with an explicit update to the data structure. Some recent papers have 
explored indexing issues for dynamic properties - that is, properties whose value 
changes continuously as time passes without an explicit update. Specifically, re- 
search has focussed on indexing and query processing over mobile data PSEC2S. 
Most of this work has concentrated on techniques to support spatio-temporal 
range queries based on current motion properties of objects projected onto a 
time in the future. For example, in m, Quadtree is used to partition the em- 
bedded space in which object motion occurs. Mobile objects are represented 
by clipping their motion trajectories into the collection of leaf cells that they 
intersect. Standard techniques are then used to support spatio-temporal range 
queries. 

In 0 , motion is represented in a parametric space using the velocity of the 
object and its projected location along each dimension at a global time reference. 
Given a parametric representation of motion, a standard multidimensional index 
structure (specifically, kDB-tree jllj) is used with dimensions corresponding to 
velocity and location. The disadvantage of indexing positions projected at a 
global time reference is a loss of locality. By this, we mean that the relative 
placement of objects’ projections at a global time reference does not carry much 
information as to their location at the present or future times. The authors also 
suggest an improvement to their approach by transforming and projecting 2-d 
parametric space (i.e. , velocity and location dimensions) onto 1-d space and use a 
B-tree to index 1-d data. This technique is, however, applicable only to motion 
in 1-d space unless multiple B-trees are used (one per spatial dimension) for 
multidimensional motion. Furthermore, the projection of 2-d parametric space 
corresponding to the velocity and location to a single dimension also sacrifices 
the accuracy of the answer. 

In jl2j . similar to 0, motion is represented in a parametric space defined 
by its velocity and projected location along each spatial dimension at a global 
time reference. The parametric space is then indexed by a new index structure 
referred to as the TPR-tree (Time-Parameterized R-tree). TPR-tree is simi- 
lar to R*-tree [2J except that TPR-tree does not represent a node’s boundary 
with a bounding box. Rather, a node boundary of TPR-tree corresponds to an 
open-ended trapezoidal shape in the native space where the parallel sides are 
perpendicular to the temporal axis. The splitting algorithm splits an overflow 
node into two nodes whose corresponding trapezoids are minimized in size (or 
volume). The search algorithm for a range query also performs computation on 
the native space by checking the overlap between the range of the query and 
the trapezoid representation of the node. The TPR-tree uses an optimization 
that attempts to compact the spatial coverage of the leaf nodes (resulting in 
improvement of query performance). However, the optimization can be used if 
the data structure is used to answer only future queries. 
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3 Indexing and Query Processing of Motion 

3.1 Model and Assumptions 

We represent motion of objects using motion parameters and location functions. 
A location function is a generic method by which the position of an object at 
any valid time can be deduced and it corresponds to a type of motion, e.g., 
linear translation. Motion parameters specify the instantiation of the location 
function, e.g., a starting location and speed of motion in the linear translation 
example introduced in Sect, d Associated with each object’s motion is a valid 
time interval t = [b, ty,] when the motion is valid. The location functions can be 
used to compute the position of the object at any time t' Gt in a c?-dimensional 
space. In this paper, we are primarily concerned with object trajectories that 
correspond to linear motion with constant speed. The following location function 
determines the location of an object O at time t': 

O.Xi(t') = O.Xi + O.Vi(t' — 0.t\) where i = 1,2, ..., d and t! G [0.t\, 0.t\f (1) 

where O.Xi is the location and O.Vi is the velocity of the object along the i th 
dimension respectively at the starting time 0.t\, for all i = 1. . . . , d. 

As time progresses, the motion of an object may deviate from its representa- 
tion in the database (as the vector of its velocity changes). When the deviation 
exceeds a threshold, an object (or sensors tracking the object) updates its mo- 
tion parameters and/or the location function stored in the database to reflect 
its most current motion information. Each update contains the location O.x i...d, 
the velocity O.v i...d, the time of the update 0.t\ and the time of the next update 
(expiration time of this update), O.t h. 

The expiration time can be viewed as a promise to issue an update no later 
than O.th- If an update is issued at time f new < O.th, then the new motion 
information takes precedence over the previous one for the interval [t n ew>0-th] 
in which they are both valid. In practical terms, this means either that the new 
update replaces the old one (no historical information is maintained), or that 
the two co-exist and thus for a given time, the object might be represented (and 
thus retrieved) by more than one database object. Thus, it would be necessary to 
process objects after retrieval to discard duplicates by retaining the most recent 
update. 

Note that there is a tradeoff between the cost of update (which depends upon 
the update frequency) and the precision of information captured by the database. 
The update management issues including the above tradeoff have been discussed 
previously in m If the imprecision of an object’s representation is taken into 
account, its location at a given instance of time, instead of corresponding to a 
point in space (as suggested by the location function above), will rather be a 
bounded region that represents the potential location of an object. For simplicity 
of exposition, we will assume that the parameters used to capture the motion 
of an object in the database precisely determine its location until the next up- 
date of the object’s motion. The mechanisms for indexing and query processing 
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proposed can be extended straightforwardly to the case when the imprecision in 
the location of an object is taken into account (that is, the location of an object 
at each instance of time, instead of corresponding to a point, corresponds to a 
bounded region). 

To evaluate the spatio-temporal selection queries, we do not wish to scan 
through the whole database to check the query predicate for each object. In- 
stead, for efficiency, we index these objects in a multidimensional data structure 
and utilize the data structure to identify the objects that satisfy the query pred- 
icate. There are many ways to index the moving objects. In general, we can 
classify indexing mechanisms into two types: Native Space Indexing (NSI) 
and Parametric Space Indexing (PSI) which are discussed next. In the dis- 
cussion, the following notation will be used: 

Definition 1 (Interval). I = [l, h\ is a range of values from l to h. If l > h, I 
is an empty interval (0). A single value v is equivalent to [v,v]. Operations on 
intervals are intersection (C\) and overlap (§). Let J = [Ji, Jh] and K = [K\, K\f\. 
JnK = [max( Ji, Kf), min( Jh, -Kh)]- J Q K returns the boolean value of J D K ^ 
0 . 



Definition 2 (Box). OB = (Ji,/ 2 , . . . , /„) = (I i... n ) is an n-dimensional box 
covering the region of space I\ x I 2 x . . . x I. n C 3? d . A box OB is an empty box 
(OB = 0 4=> 3 i : Ii = 0). An n-dimensional point p = (vi... n ) is equivalent to 
box ([ui,i>i], [^ 2 ,^ 2 ], . . . , [v n ,v n ]). Operations on boxes are the same as those on 
intervals. Their meanings are intuitive and similar to those of intervals. 

3.2 Native Space Indexing Technique (NSI) 

Consider an object O whose motion is captured by a location function 
O.Xi^dft)- Let the valid time for the motion of the object be O.t. In 
NSI, the motion of O is represented with a bounding box with extents 
[min tg0 j O.Xift), max (g0 j O.Xift)] along each spatial dimension i, and extent 
O.t, along the temporal dimension. A multidimensional data structure (e.g., R- 
tree) is used to index the bounding box representation of motion. Note that the 
index will contain multiple bounding boxes to represent the motion trajectory of 
an object, one per motion update. We next discuss how different types of queries 
are evaluated in the NSI strategy. While the algorithms discussed below are ap- 
plicable for any multidimensional data structure, in the following discussion, we 
will assume that an R.-tree is used to index motion. 

Spatio-Temporal Range Queries: Evaluating range queries in NSI is 
straightforward. The standard R.-tree range search algorithm is used to traverse 
the data structure for objects whose motion trajectory overlaps with the query. 
The search begins from the root and all children whose bounding boxes overlap 
with the query region are traversed. The bounding boxes stored in the leaf node 
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represent objects’ boundaries. If these bounding boxes overlap with the query, 
the corresponding object is retrieved. 

Note that it is quite possible that even though an object’s motion trajectory 
does not overlap with the query, the bounding box corresponding to motion 
does overlap, resulting in false admissions. The cost of such false admissions is 
quite significant since each such admission may result in one extra disk access 
to retrieve an object that will immediately be discarded. 

The problem of false admissions in the NSI strategy can be eliminated by 
representing motion as a line segment instead of a bounding box at the leaf 
level of the R-tree. The representation of a line segment is similar to that of a 
bounding box. That is, a bounding box is represented by its lower left corner 
point and its upper right corner point while a line segment is represented by its 
starting point and its ending point. Detecting an overlap between a query and 
a line segment object is more complicated than detecting an overlap between a 
query and a bounding box object. 

To detect if there is an overlap between query’s bounding box ( Q ) and the 
line segment corresponding to the motion of an object (O) we compute the time 
interval that the motion trajectory overlaps with the query bounding box along 
each spatial dimension i separately and then intersect the computed intervals. 
If the intersection is empty, then the motion does not overlap with the query’s 
bounding box and thus the query can ignore that motion. Otherwise, the object 
is returned as an answer. 

Formally, let Tq,o be the time interval that the motion overlaps with the 
query’s bounding box and Tq,cm be the time interval that the motion overlaps 
the query’s bounding box along dimension i. We compute Tq,o,z as follows. Con- 
sidering only one spatial dimension with the temporal dimension, as shown in 
Fig. El we extend the line segment corresponding to the motion at both ends 
infinitely. Next, we compute the time fj jS and which this infinite line cuts 
the lower border (i.e., the line equation Xi = Q.xn) and upper border (i.e., the 
line equation a = Q-^ih) of the query’s bounding box along spatial dimen- 
sion i. Then, we compute Xqo ?; as the intersection of the query’s time interval 
([Q-t s ,Q-te]), the motion’s time interval ([0.f s , 0.t e }) and That is, 

t Q,o = nf =1 TQ,o,i where T Q)CM = **,e] H [0.t s , 0.t e \ C [Q.t h Q.t h } (2) 

where d is the number of spatial dimensions. In case the motion along spatial 
dimension i is parallel to the temporal axis (when there is no movement along 
that spatial axis), the motion intersects with the query if the motion is in between 
the lower and the upper borders of the query’s bounding box. Notice that the 
motion can never be perpendicular to the temporal axis since an object cannot 
be at multiple locations at the same time. The complexity of this detection is 
0(d ), where d is the number of the spatial dimensions, which is the same as the 
complexity of detecting the overlap of two bounding boxes. 



66 



K. Porkaew, I. Lazaridis, and S. Mehrotra 




Fig. 2. Temporal Intersection of Object Motion and Query Bounding Box 



We compute t ls and t i e by finding the intersection of the upper bound Q.x a 
(and lower bound Q.Xa) of the bounding box query and the line that connects 
between ( 0.t s ,0.Xi S ) and ( 0.t e ,0.Xi e ) as follows: 
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Temporal kNN Queries: A temporal kNN query is defined by a spatial range 
(IQ.xa, Q-Xih]) that all answers must satisfy, and a temporal value (Q.t) where 
all answers are returned based on the difference between Q.t and the time the 
object is in the spatial region. A temporal kNN query in the native space can 
be processed as follows. 

Similar to the range query, for each node that the search algorithm encoun- 
ters, we check if there is an intersection between the query’s spatial bounding 
box and the R-tree nodes’ spatial bounding box as in ( 0 . 

{R-x i, . . . , R.x d ) 0 (Q.x i, . . . , Q.x d ) (5) 

If ( 0 ) is false, we ignore that node. Otherwise, we compute how close the node’s 
time interval is to the query’s time so that we know the order that we want 
to visit that node. We measure the closeness with t v i s j t which is computed as 
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follows. 



) R.t\ — Q.t if Q.t < R.t\ 

Q.t - R.t h if Q.t > R.t h (6) 

0 otherwise 

Nodes are visited in ascending order of f v isit ■ For this purpose, similar to IQ. 
a priority queue of nodes sorted based on their t v ; s i t is used. That is, for each 
node encountered, if its corresponding bounding box overlaps spatially with the 
query’s spatial range, we compute its i v isit and insert this t v i s it along with the 
node id in the priority queue. Then, we pop a node from the priority queue, 
load it from the disk, and examine its children’s bounding box. This process 
continues until an object is popped from the queue and returned as the nearest 
answer so far. The user may request for the next nearest answer by continuing 
popping a node from the queue and examine bounding boxes of the children 
of the node until it pops an object id. Leaf nodes are handled differently from 
internal nodes, since we use the segment representation optimization instead of 
the bounding box, as we described earlier in the context of range search. We 
compute the time interval that the motion overlaps with the query’s bounding 
box (Tq, 0 ) as follows. 

Tq,o = where Tq,cm = [*», s , **,e] D [0.t s ,0.t e ] (7) 

We compute t i s and e just the same as m and ©■ If Tq q results in 0, the 
object is ignored. Otherwise, we compute f visit, for the object as follows. 

! 0.t s — Q.t if Q.t < 0.t s 
Q.t - 0.t e if Q.t > 0.t e (8) 

0 otherwise 

This t visit is then inserted to the priority queue along with the object id. Note 
that, in case of a one-sided kNN query - e.g., the most recent or the nearest 
future, © is still applicable except that we will not visit the node (or the object) 
for which Q.t < R.t\ (or Q.t < 0.t s ) in case of a “most recent” query. Similarity, 
we will not visit the node (or the object) for which Q.t > R.th (or Q.t > 0.t e ) 
in case of a “nearest future” query. 

Spatial kNN Queries: Recall that a spatial kNN query is defined by a tem- 
poral range ([Q.t\, Q.fh]), where all answers must satisfy, and a spatial location 
( Q.Xi ) where all answers are returned based on how close its location ( O.Xi ) is to 
Q.Xi. The search algorithm for a spatial kNN query starts from the root node of 
the tree and visit nodes whose bounding box R satisfies the condition: R.t 0 Q.t. 
If this condition is false, the node is ignored. Otherwise, we calculate the order 
that each node should be visited. We visit each node based on the ascending 
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order of s v i s it which is computed as follows: 
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That is, s v isit is the shortest distance from a box to the query point. For the 
leaf nodes where the actual motion are stored, we compute the shortest distance 
(svisit) from the line segment corresponding to the motion of the object to the 
query point as follows. First, we calculate the time interval ([t s ,f e ]) that the 
motion intersect the query along the temporal axis. Next we interpolate the 
location of object at time t s and t e as S and E respectively. That is, 

[t s , t e ] = [0.t s , 0.t e ] 0 [Q.t s , Q.t e \ (10) 

S.Xi = O.X is + (O.Xie - O.X is ) ^ 0 '^ (11) 

E.Xi = O.Xis + ( O.Xie - O.Xis) 0 Q S t ( 12 ) 

Let s be the spatial distance between the query point and 5; e be the spatial 
distance between the query point and E; and h be the spatial distance that the 
object travels from S to E. The shortest distance from the query to the object 
may be s, e, or neither (i.e., to) as shown in Fig. El Using Pythagoras’ theorem 
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Fig. 3. Computing the shortest distance from a query to each motion 



the shortest distance can be computed as follows: if h 2 < |s 2 — e 2 |, the shortest 
distance is min(s, e). Otherwise, the shortest distance is to = ^ e 2 — (- 2 ~^ +e2 ) 2 
Once we get the shortest distance between the motion and the query point during 
the query’s time interval, we insert this distance along with the motion id into 
the priority queue. 
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Indexing Other Motion Types: The NSI strategy can be extended in a 
straightforward fashion to support objects with motion other than linear trans- 
lation by representing their motion trajectories using a bounding box that covers 
the trajectory regardless of its shape. However, as discussed earlier, the bounding 
box representation will result in false admissions. The problem of false positives 
can be alleviated in one of the following two ways. First, (similar to the ap- 
proach discussed above for linear translation), object motion at the leaf level 
can be represented using a higher order curve (based on the location function 
associated with the motion) . For each motion type we will have to develop tech- 
niques to determine temporal intercepts between the motion representation and 
the query along a spatial dimension (to evaluate spatio-temporal range query) 
and techniques to evaluate t v i s ; t and s v j s i t to evaluate kNN queries. 

An alternative approach to alleviate the false admission problem might be 
to use multiple smaller bounding boxes to cover the trajectory instead of a 
single large bounding box. The approach remains correct as long as the set of 
bounding boxes together cover the entire motion trajectory. Using multiple small 
boxes has an advantage that it reduces the number of false admissions since it 
represents more compact envelopes of the trajectory. Furthermore, it potentially 
reduces the overlap between the index nodes resulting in better performance. 
Nevertheless, it increases the size of the index which could result in the increase 
in the height of the tree causing increased I/O. Details of how motion trajectory 
can be segmented into bounding boxes and the resulting tradeoff can be found 
in 0. 

3.3 Parametric Space Indexing Technique (PSI) 

Unlike NSI strategy in which motion is represented in the native space, in the 
PSI strategy, motion is represented in the parametric space consisting of motion 
parameters. The PSI strategy works as follows. Let the motion parameters be 
O. O.v and [0.t s ,0.t e \ where x i and Vi are the location and velocity 
of object O in the * th spatial dimension and [0.t Sl 0.t e \ represents the valid 
time interval of the motion. In the PSI approach, an R-tree is used to index 
a (2 d + l)-dimensional parametric space where the dimensions correspond to 
Xi...d, tq...d and t. Each motion requires storage of (2d + 2) floating-point values 
since each motion stores a temporal range [f s ,f e ] that indicates the valid time 
interval of the motion. The motion representation is interpreted as a line segment 
in the parametric space parallel to the time axis connecting points (O.x i...d, 
O.v i...d, 0.t s ) and (O.aq.d, O.v i...d, 0.t e ). Each bounding box in the parametric 
space requires a storage of 2(2d+l) floating-point values corresponding to ranges 
in each of the parametric dimensions - aq.d, iq...d and t. 

Our parametric space representation differs from the ones in Em in that 
we store the temporal range in which the motion is valid and we do not project 
the motion back to a global time reference. This overcomes the limitation of 
PSI strategies based on storing motion relative to the global time reference, as 
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we discussed in Sect. 2. An additional benefit of storing the temporal range 
explicitly is that now historical queries can easily be supported, provided that 
historical data are not deleted from the index. 

While parametric representation of motion is straightforward, to evaluate 
a query either the condition specified in the query (which is a condition over 
the native space) needs to be transformed into an equivalent condition over the 
parametric space, or alternatively, the bounding box corresponding to a node in 
parametric space needs to be mapped to a corresponding region in the native 
space. 

Spatio-Temporal Range Queries: Let (Q.xi , ..., Q.x dl Q-t) be the range 
query. Given the parametric space representation of moving objects, we want to 
be able to answer spatio-temporal range queries efficiently. Identical to searching 
in the native space, the search over the parametric index begins at the root of 
the index structure and nodes in the index structure are recursively examined. 
Let R be a node in the tree and the parametric subspace corresponding to R 
be a bounding box (xi, . . . ,Xd,v i, . . . ,v d ,t)- Node R is visited by the search 
algorithm if there may exist an object O in the parametric subspace indexed 
by R such that the location of O during the time interval t satisfies the spatial 
constraint of query Q. That is, 

3 1 {t £ ( Q.t n R.t) A (R.Xi(t ), ..., R.X d (t)) £ (Q.x i, ..., Q.x d )} (13) 

To check if there exists such an object O inside R , we need to compute 
the time interval that the bounding box query may overlap with any motion 
(projected in the native space) that has parameters within the range of R. We 
can calculate such a time interval along each spatial dimension and intersect 
them as follows. 



Tq.r = n? = iT Q , R)i (14) 

where the time interval Tqr is the time interval that Q overlaps R and Tq iRi j is 
the time interval that Q overlaps R along dimension i. Tq R , can be calculated 
as Tq k , = [min(f'), max(f')] where t ' satisfies the following equation. 

Q.xa < Xi + Vi ■ ( t ' - t) < Q.x ih (15) 

subject to the following constraints: (1) [ t , Xj, x,] £ R (the constraint of the node’s 
bounding box), (2) t' £ [R.ti,R.th] D [Q.t\,Q.th] (the answer’s constraint), and 
(3) t' > t which guarantees that the starting time of the motion, t , is before 
the time, t ', that the motion intersects the query. Note that m is a non-linear 
equation with a set of linear constraints. There is no standard technique to solve 
such a non-linear equation. Fortunately, we can simply solve Tq_ r _,; by utilizing 
the specific properties of our problem domain (i.e. , by mapping the parametric 
bounding box to a corresponding trapezoid region in the native space). There 
may be three cases of such mappings as shown in Fig. 0 In Fig. 0 (b) where 
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X (location) X (location) 






tl th T (time) 

(c) vl<=vh<=0 



Fig. 4. A projection of a parametric bounding box onto the native space 



R.Vi i < 0 < R.Vi h, the left and right borders of the trapezoid correspond to 
R.t\ and R.t^ respectively. The lower and upper bounds of the left border of the 
trapezoid correspond to R.xa and R.x^ respectively. The slope of the lower and 
the upper borders of the trapezoid correspond to R.vn and R.Vih respectively. 
That is, all motion in the bounding box starts moving from the range of location 
within [R.xn, R.Xih], moves with the range of velocity within [R.vn, R.Vih\, and 
the starting time and the ending time of each motion are within the range 
of [R.ti, R.th]. In Fig. 0 (a) where 0 < R.vn, unlike Fig. 0 (b), the slope of 
the lower border of the trapezoid is zero instead of R.va since there may be 
some motion that starts at time t during R-t^] from location R.x,\ (as 

shown with a thick arrow in the figure). Similarly, in Fig. El (c), the slope of the 
upper border of the trapezoid is zero instead of R.Vi h- We use this trapezoid 
representation of a parametric bounding box in the native space to compute the 
overlap between the query and each R-tree node as follows. First, let’s ignore 
the temporal range of the query and calculate when the trapezoid overlaps the 
spatial range of the query along each dimension. Let [fi, s ,fi ie ] be the temporal 
range that the trapezoid overlaps the spatial range of query. Then, we calculate 
TQ,R,i as follows. 

Tq,r y i = H [Q.ti, Q-th] fl [R.t\, .R.fh] (16) 



where f; iS 
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(18) 
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If ti, s > ti e or ti tS and ti, e are undefined (_L), ?Q,R,i is 0. Like range queries in 
the native space index, if Tq.r ^ 0, the search algorithm will visit R. 

For a leaf node where motions {t,x i, . . . , Xd, iq, . . . , Vd) are stored, we can 
calculate t is and as in (0 and (EJ by replacing ® an d ® 

with O.Vi. That is, 



fi,e — * 



o.t s + 


Q.Xi i - O.Xi 
O.Vi 


if O.Vi > 


o.t s + 


CJ'Xih O.Xi 


if O.Vi < 


O.Vi 


o.t s 




if O.Vi = 


_L 

O.ts + 


Q O.Xi 


otherwise 
if O.Vi > 


O.Vi 


O.ts + 


Q-xn - O.Xi 


if O.Vi < 


O.Vi 


0.t e 




if O.Vi = 


_L 




otherwise 


testing of trapezoid 


intersection 



(19) 



( 20 ) 



if ti^s > ti e or fj jS and t ie are undefined (_L), Tqrj is 0 and thus Tq,r is also 
0. If Tq r is not 0, the motion is returned as an answer. 



Temporal kNN Queries: Similar to the range query in the parametric space, 
we compute the temporal range (Pq.r) that the trapezoid corresponding to each 
node in parametric space (or the object motion stored in the leaf node) overlaps 
with the spatial range of the query by computing such a temporal range along 
each spatial dimension and intersecting them together (Tq r^) as shown in (Ell- 
Similar to the range query computation in ED- we compute Tq.r,* = [ii ;S , ti, e ] n 
(R.fi, R.th] where fj iS and f^e is the same as IT7I ) and EJ. Since 2q,r is the time 
interval that the query overlaps with each node or each motion, we want to visit 
each node and retrieve each motion in the ascending order of the time difference 
between Tq : r and Q.t. Let f v isit be the time difference which is also the order 
that we will visit each node (or motion). Let [f s ,t e ] = Tq,r, we compute f v isit as 
follows. 

f f s - Q.t if t a > Q.t 

fvisit = l Q.t- t e if Q.t > t e (21) 

I 0 otherwise. 



Spatial kNN Queries: Like spatial kNN queries in the native space, we check 
if the query overlaps with the node temporally. If not, we ignore that node. 
Let [P.fqP.fh] = [R.fpR.fh] n [Q.fpQ.fh] be the temporal overlap between the 
node and the query. If [P.fi,P.fh] = 0, the search algorithm will not visit the 
node. Otherwise, we compute the shortest distance between the query’s bounding 
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box and the node’s boundary (,s v i s itj- To compute s v uit, we first project the 
parametric bounding box during the time interval [P.ti, P.th], to a bounding box 
in the native space as follows. Let ([P.tp P.th], [P.xn, P.Xi^}) be such projection. 
We compute P.x a and P.x ih as follows. 



P.Xn 
P-Xi h 



R.Xi\ + R.vn^P.th — R.t\) 


if R.vn < 0 


(22) 


R.xn 


otherwise 


R'^ih. P R.Vi h.(P.t\\ R.t\) 


if R.Vi h > 0 


(23) 


R'^ih. 


otherwise 



That is, we project the upper border of the parametric bounding box to the 
upper border of the native bounding box P and project the lower border of the 
parametric bounding box to the lower border of the native bounding box P. 
Then we compute the shortest distance (s visit] between the query point Q.Xi the 
bounding box P as follows. 



^visit 



\ 



^2(P.Xi - Q.Xi) 2 



i = 0 



where P-Xi 



! P-Xi i if P.xn > Q.Xi 
P-Xih if P-Xih < Q-Xi (24) 
Q.Xi otherwise 



In case of the leaf node where motions are stored, we compute the shortest 
distance (s v isit) from motion O to query Q the same way that we do for the 
native space except that the native space motion is represented by the starting 
and ending point of motion while the parametric space motion is represented 
by the starting point and the velocity. Given the starting location and the time 
interval of the motion, the velocity can be derived from the ending of the motion 
and vice versa. Thus, the native space calculation for the leaf level node can be 
applied to the parametric space calculation. 



4 Empirical Evaluation 

In this section, we experimentally study the performance of NSI and PSI strate- 
gies. We compare their disk access performance in terms of the number of disk 
accesses per query as well as their CPU performance in terms of the number of 
distance computations. 

Data: We randomly generate motion for 5000 mobile objects over a 2- 
dimensional 100-by-100 grid. Each mobile object updates its motion parame- 
ters (i.e., velocity and location) approximately every 1 time unit and data is 
collected over 100 time units. This results in a total of 502504 linear motion 
segments generated. To study the impact of speed of the mobile object on the 
indexing strategy, three different data sets are generated with a varying length 
of motion segment along each spatial dimension per time unit. Data is generated 
for segment lengths corresponding to 1 (slow speed), 2 (medium speed), and 4 
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(high speed) units. Unless otherwise stated, the experiments reported are based 
on index generated for slow speed (though query performance results are similar 
for data sets corresponding to higher speed of objects). 

Index Buildup: The generated motion segments are indexed using an R-tree 
(based on both the PSI and NSI strategies). An R-tree page size is set to 4KB 
with a 50% fill-factor for all nodes. In the NSI R-tree, the fanouts are 145 and 
127 for the internal/leaf node level respectively and the height of the resulting 
tree is 3. There are 83 internal nodes and 6806 leaf nodes. In the PSI R-tree, 
the corresponding fanouts are 92 and 127. The height of the tree is 4. This is 
because a parametric bounding box requires an additional dimension to store 
velocity for each spatial axis. There are 124 internal nodes and 6151 leaf nodes. 
Note that the number of leaf nodes in the native space R-tree is surprisingly 
slightly higher (even though it has a higher fanout) compared to the number 
of leaf nodes in the parametric space R-tree. This suggests that an addtional 
dimension per spatial axis in the parametric space index does not have much 
negative impact on the parametric space index. 

Queries: To compare performance of NSI and PSI for different query types 
1000 queries of each type are generated and results averaged for each experiment. 
To study performance of range query, randomly queries of different sizes - 0.25, 
2.0, 4.0, 6.0, 8.0, and 10.0 along each spatial/teporal dimension - are generated. 
For temporal kNN queries, the temporal predicate is a single point while its 
spatial range predicate is set to a square range of size 6x6 and 10x10. For spatial 
kNN queries, the spatial predicate is a single point while its temporal range 
predicate is a random chosen range of sizes 6 and 10. 

Experiments for Range Query: Fig. 0 plots the total number of disk 
accesses (including the fraction of disk access at the leaf level) for different 
query sizes. The figure shows that the number of page I/Os increases as the 
query size increases in both NSI and PSI. NSI outperforms PSI in terms of I/O 
performance. Each histogram bar in the plot is broken into two parts: the lower 
part represents the number of leaf-level disk accesses and the upper part the 
number of non-leaf disk accesses. As shown, in both NSI and PSI, the majority 
of disk accesses occurs at the leaf level. 

Fig. 0 compares the CPU performance in terms of the number of distance 
computations. Similar to Fig.0 each histogram bar shows the number of distance 
computations at the leaf- (lower part) and non-leaf part (upper part). The results 
show that NSI performs better than PSI in terms of CPU cost. Note that each 
distance computation of PSI is more costly than that of NSI. The number of 
leaf level distance computations is an indicator of how well the index structure 
organizes the data - lower the number of distance computations, better is the 
locality preserved by the index structure. A significantly higher number of leaf 
level distance computations of PSI compared to NSI indicates that NSI preserves 
locality better compared to PSI. This result is similar to that of (Hj , in which the 
PSI/NSI problem was studied in the context of indexing of objects with spatial 
extent as points in a parametric space. 
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comparison of the number of disk access 
(overall and leaf level disk access) 



comparison of the number of distance computation 
(overall and leaf level) 



140 

120 

100 

80 

60 

40 

20 

0 



' — 


parametric space index 
native space index 








3 

Q. 

£ 

o 


10000 






parametric space inde 
native space index 


X 
























o 


8000 




















■ 




















to 


6000 












































o 


4000 


























— 






















CD 

X3 

E 


2000 


■ F 














































3 

C 


0 




F- 












































<D 


0 


2 






6 




10 



0 2 4 6 8 10 

the query size along each native dimension 



Fig. 5. I/O of range queries (NSI vs. PSI) 
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Fig. □ and Fig. 0 measure the impact of the speed of objects. Both figures 
indicate that query performance on high speed objects is worse than that of low 
speed objects. This is because high speed objects require larger boxes to cover. 
The figures also show that the impact of object’s speed in PSI is higher than 
that in NSI. This is due to the fact that greater variations of speed, lead objects 
that are relatively close in parametric space to greatly varying locations in the 
actual, i.e., native space. 

Experiments on Temporal kNN Query: Fig. 0 shows the relationship 
of the number of disk access and the number of temporal nearest neighbors for 
various spatial range of queries (i.e., 6x6 and 10x10). The result shows that, for 
each query size, the number of disk access increases fast at the begining when 
it starts exploring the tree and later the number of disk access increase slowly. 
A large query size require a higher number of disk access than a smaller query 
size in order to retrieve the same number of nearest neighbors. Similar to the 
performace of range queries, NSI appears to outperform PSI. 

Experiments on Spatial kNN Query: Fig. El shows the relationship of 
the number of disk access and the number of spatial nearest neighbors for various 
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Temporal kNN search Spatial kNN search 
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top k nearest neighbors top k nearest neighbors 

Fig. 9. I/O of temporal kNN (varying Fig. 10. I/O of spatial kNN (varying query 
query spatial ranges in NSI and PSI) temporal ranges in NSI and PSI 

temporal range of queries (i.e., 6 and 10). The result is similar to the result of 
the spatial kNN queries. 

5 Conclusion 

This paper explores effective techniques for indexing and query processing over 
spatio-temporal databases storing mobile objects. The primary difficulty in de- 
veloping query processing techniques arises from the dynamic nature of motion 
properties of objects - that is, the location of an object changes continuously 
as a function of time without an explicit update to the database. The paper ex- 
plores algorithms to evaluate three different types of selection queries- namely, 
spatio-temporal range queries and spatial and temporal nearest neighbor queries. 

The algorithms are developed under two two different representation and 
indexing techniques, native space- and parametric space- indexing (NSI/PSI). 
While NSI indexes motion in the original space in which it occurs, the PSI 
strategy indexes it in the space defined by the motion parameters. NSI preserves 
the locality of objects but also indexes “dead space” in the non-leaf levels of the 
index tree. PSI on the other hand uses the motion parameters to represent an 
object but suffers from the fact that objects that are close in parametric space 
might be somewhat divergent in real space and that objects that are far away in 
parametric space (e.g., two people starting at opposing sides of a room, moving 
with opposing velocities) may be at some time very close to each other (meeting 
in the middle of the room). Additionally, more complex types of motion (e.g., 
with acceleration) cannot be easily represented. In our future work we intend 
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to establish in more detail, both analytically and experimentally the conditions 
under which NSI/PSI should be used. 
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Abstract. This paper addresses the problem of finding k nearest neigh- 
bors for moving query point (we call it fc-NNMP). It is an important issue 
in both mobile computing research and real-life applications. The prob- 
lem assumes that the query point is not static, as in fc-nearest neighbor 
problem, but varies its position over time. In this paper, four different 
methods are proposed for solving the problem. Discussion about the pa- 
rameters affecting the performance of the algorithms is also presented. 
A sequence of experiments with both synthetic and real point data sets 
are studied. In the experiments, our algorithms always outperform the 
existing ones by fetching 70% less disk pages. In some settings, the saving 
can be as much as one order of magnitude. 



1 Introduction 

One of the most important operations in a spatial database management system 
is the fc-nearest neighbor search. The problem is: given a set S of n sites and a 
query point q , find a subset S' C S of fc < n sites such that for any sites p± £ S' 
and P2 £ S — S', dist(q,pi) < dist(q,p2). This is also called top-k selection 
query. The fc-nearest neighbor searching is used in a variety of applications, 
including knowledge discovery and data mining |EJ, CAD/CAM systems [fflj 
and multimedia database Ca- 
in this paper, we study the following case: the query point q is a moving point. 
We call it “fc-nearest neighbor search for moving query point” (A'-NNMP). As 
the following example illustrates, the problem arises naturally in mobile com- 
puting environment. 

Example } TB] [Mobile E-commerce]: Provide the following service for a 
moving car: tell the driver where the nearest gas stations are. In the above 
example, the car is a moving query point which changes its location continuously. 
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Gas stations are sites in set S. The sites are static and their positions are known 
in advance. The service is to provide the query point with real-time reports 
about the k nearest neighbors. 

A special case: 1-NNMP has been studied in Computational Geometry for 
quite a long time and the solution is trivial after Voronoi diagram is used. Given 
a site set S, the Voronoi diagram can be pre-computed in an efficient 0(?rlogn) 
algorithm [jjjJ where n is the number of sites in S. Then for a moving query point 
q , at each sampled position, the nearest neighbor can be quickly found in 0(log n) 
time by locating the cell of Voronoi diagram that contains q. Unfortunately, the 
Voronoi solution cannot be extended to general fc-NNMP problem because it is 
very difficult to generate a generalized k - Voronoi diagram. 

Before we start to search for solutions, we need to find a way to represent 
the movement of the query point. In most cases, the trace of a moving query 
point is a continuous curve in working space and it can hardly be described 
by a function within reasonable level of complexity. Thus periodical sampling 
technique is widely used: the time period is divided by n + 1 time stamps into n 
equi-length intervals. At each time stamp, which we called “sampled position”, 
the location information of the query point is collected. The location of the 
query point between two consecutive sampled positions is estimated using linear 
or polynomial splines. In this paper, we adopt this approach because it provides 
a good approximation when the time intervals are short. Later in this paper, 
without confusion, we assume that sampled position t , time stamp t, and time t 
are exchangeable. 

When periodical sampling technique is applied, in order to provide an accept- 
able solution for fc-NNMP problem, we need to find correct k nearest neighbors at 
every sampled position. A naive solution is: to launch a new fc-nearest neighbor 
search at each sampled position. Since the fc-nearest neighbor search operation 
is relatively expensive, this solution is inefficient. 

We propose a progressive approach in this paper. Our methods are motivated 
by the following observation: when two query positions q\ and q 2 are close, the 
results of the fc-nearest neighbor search Si and S 2 must be related. Once we have 
Si, S 2 can be achieved with much less work. To authors’ knowledge, the current 
research interest is mainly focus on designing a sophisticated index structure for 
sites so that the answer can be found quickly for any query points 0. 

One assumption is made in this paper: the information of sites is provided 
before query and the sites are stored in a R-tree-family structure. R-tree and 
its variants fZ] are excellent structures for indexing spatial data. The sites are 
treated as points in R-tree. 

Our main contributions include: 

— We review the fc-NNMP problem from a new point of view. Instead of looking 
for an efficient spatial index structure for sites, at each sampled position, we 
try to utilize the information contained in the result sets at the previous 
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sampled positions. In some cases, we can extract the correct answer just 
from the history data and no further search is necessary. 

— If the previous result set is obsolete, a new search directly from the sites 
is unavoidable. Our algorithms will provide a much better start point: the 
initial search bound is much lower than the pure static branch-and-bound 
algorithms. Thus, more nodes in the R-tree can be pruned without check- 
ing. In our experiments, we show that with this improvement, the total 
disk pages accessed by our algorithms are about 70% less than the existing 
ones. 

— To obtain the first several result sites quickly is another interesting problem 
in the fc-nearest neighbor search problem. In our progressive methods, the 
history result set is scanned first and those sites which, by our theorem, 
must be in the current result set are picked out. Therefore, in most cases, 
our algorithms find the first several result sites quickly. 

— If the next position of the query point can be precisely predicted, we further 
optimize our algorithm by introducing another buffer. The second buffer 
stores the site candidates for the next query point position during the search 
procedure. Therefore, the initial search bound is even lower. 

The organization of this paper is the following. In section 2, related work is 
introduced. In section 3 four algorithms are proposed. The experimental results 
are studied in section 4 and in section 5 conclusions and future research plans 
are presented. 



2 Related Work 



Recently, many methods have been proposed for fc-nearest neighbor search in 
static environment. They can be classified into two categories depending on the 
total rounds of searches on the site set na 

The first category is called single-step search. The methods in this category 
scan the site set only once. 

Roussopoulos et. al. G3] proposed a branch-and-bound algorithm for query- 
ing spatial points storing in an R-tree. Meanwhile, they introduced two useful 
metrics: MINDIST and MINMAXDIST for ordering and pruning the search tree. 
The algorithm is briefly described as following. 

— Maintain a sorted buffer of at most k current nearest neighbors. 

— Initially set the search bound to be infinite. 

— Traverse the R-tree, always lower the search bound to the distance of the 
furthest nearest neighbor in the buffer, and prune the nodes with MINDIST 
over the search bound, until all nodes are checked. 
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The performance of this algorithm was further investigated in m and the 
metrics were later used in Closest Pair Queries j3j. 

The other category is called multi-step search. The methods in this category 
scan the dataset multiple times until the proper radius is attained. 

Korn, et. al. proposed an adapted version of multi-step algorithm 0. The 
algorithm worked in several stages. First, a set of k primary candidates was 
selected based on stored statistics. After the candidate set was checked, an upper 
bound dm ax was found, which guaranteed that at least k sites were within the 
distance dmax from the query point q. Next, a range query was executed on site 
set to retrieve the final candidates. The range query was: select all sites in the 
range R from the site set where R = {o\dist(o, q) < d max }- The number of final 
candidates could not be less than k. At last, those final candidates were checked 
again and the final results were found. 

Seidl and Kriegel further extended the method m. In their algorithm, every 
time a candidate site was checked, d max was adjusted. They reported that their 
algorithm was optimal and the performance was significant improved. 

Chaudhuri and Gravano 0 adopted the same approach. Their contribution 
was to use histograms in their algorithm to make the first d m ax guess more 
accurate. 

Finally, the possibility of using Voronoi cells in nearest neighbor search was 
studied JJ and BBD-tree for approximated search in fixed dimensions was pro- 
posed 0. 

3 K-NNMP Problem 

The following symbols are used in this section: q is the moving query point and q t 
is the location of q at sampled position t. S is the site set which contains n = \S\ 
sites. At any time t, the sites in S are sorted in ascendant order based on their 
distance to qt- The site which is ranked at position i has the distance D t (i). 
By definition, we have D t ( 1) < D t (2) < ... < D t (n). We also define et to be a 
distance upper bound so that within this distance from q tl we are guaranteed to 
find at least k sites. By definition, D t (k) < et- 

All our four algorithms and the naive solution which we mentioned in the 
first section are constructed on a static k - nearest neighbor search algorithm. 
In this paper, we pick the static branch-and-bound algorithm m because in 
this algorithm, no database statistics are required. So its query performance is 
foreseeable and stable. 

Generally speaking, the moving pattern of objects (such as the pedestrians) 
is random rather than following some specific rules. Our first three algorithms 
are designed for the general cases. In some special cases, the next position of 
objects (such as an airplane) can be calculated. With the extra information, our 
methods can be further optimized. The fourth algorithm is for this purpose. 
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3.1 Fixed Upper Bound Algorithm 

The naive solution launches a new search at every sampled position with static 
branch-and-bound algorithm and the search bound is initially set to be infinite. 
If we have no extra information, this step is essential for finding the correct 
answer. But in the fc-NNMP problem, the search result at the previous sampled 
position may give us some clues for a smaller initial guess. It is an evident fact 
that with a smaller initial search bound, the static branch-and-bound algorithm 
becomes more efficient. In this part, we want to find a smaller search bound et+i 
from the existing D t (i) where 1 < i < k. 




Fig. 1 . et+i does not need to be infinite 



Look at Figure Q1 at time t, the query point is at q t . Within the solid circle, 
there are k sites. At time t + 1, the query point moves to qt+i, the distance 
between q t and q t + 1 is S. All the sites whose distances to q t +\ are less than or 
equal to e t+ i are in the dashed circle. By definition, this e t+ i is legal if and 
only if within the dashed circle, there are at least k sites. This restriction can 
be easily met if the dashed circle encloses the solid circle. We have the following 
theorem. 

Theorem 1. Suppose at time t, the k nearest neighbors of query point at loca- 
tion q t are {pi,P2, ■ ■ ■ ,Pk} and D t {k) is the maximum distance of these sites to 
q t - At time t+ 1, the query point moves to q t+ 1 - We claim that e t+ i = D t (k) + S 
is legal, where 6 is the distance between qt and qt+i- 

Proof. By definition we have: 

distance(pi, qt ) < D t {k ) (Vi < k) 
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According to triangular inequality, we have: 

distance(pi, qt+i) < D t {k ) + 5 (Vi < k) 

which means that there are at least k sites whose distances to qt+i are no more 
than D t (k ) + S. By definition, we know that et+ 1 = D t (k ) + 5 is legal. 

The fixed upper bound search algorithm works as follow. At sampled position 
1, after the routine static bound-and-branch search, the location of the query 
point is recorded as well as D\(k). Later at sampled position t > 1, after the 
position of query point q t is obtained, the initial search bound is calculated 
according to the above theorem and a new static search is launched. After the 
search, the values of D t (k) and q t are stored and prepared for the actions at the 
next sampled position. 

3.2 Lazy Search Algorithm 

In the fixed upper bound search algorithm, a new search is launched at each 
sampled position, even though the movement of the query point is too small 
during that time period to cause any changes in the result. In this part, we give 
a lazy search algorithm and show that in some cases a new search is unnecessary. 

After observing most static fc-nearest neighbor algorithms, we find the fol- 
lowing fact: at time t, when we search for the k nearest neighbors, D t (k+ 1) can 
always be obtained for free or with little extra work. 

For example, in the adapted multi-step fc-nearest neighbor search algorithm 
0, at the second round, most of time, the result of range query contains more 
than k elements which need further check at the last step. To get the value of 
D t (k + 1), we simply record the distance of the first follow-up loser. Look at 
another example, in static bound-and-branch algorithm US. if we maintain a 
sorted buffer with size k + 1 instead of k, after the whole R-tree is scanned, the 
value of D t (k + 1) is the distance between the query point and the site which we 
stored in the extra buffer. 

With the knowledge of D t (k + 1), we have the following theorem: 

Theorem 2. Suppose at time t, the k nearest neighbors of query point at lo- 
cation qt are {pi,p 2 , ■ . ■ ,Pk}, D t {k ) is the maximum distance of these sites to 
qt and D t {k + 1) is the minimum distance of the sites outside that set. At time 
t + 1, the query point moves to qt+i, we claim that the result set will be the same 

if 

. D t {k + 1) - D t (k) 

S ~ 2 

where S is the distance between q t and q t + 1 - 

Proof. By definition we have: 

distance(pi,qt) < D t {k ) (Vi < k) 
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After the query point moves to qt+i, by triangular inequality, we have: 
distance{pi, qt+i) < D t {k ) + 8 (Vi < k) 

For any site p not in this set, by triangular inequality, we have: 

D t {k + 1) — S < distance(p , g t+ i) 

When 8 < 0+i)-.Dt ( fc ) , f or an y s jt e p no t j n se t, W e have: 

distance(pi, q t + 1 ) < D t {k) + <5 < D t (k + 1) — 6 < distance(p , g*+i) (Vi < k) 
That means {p\,p 2 , ■ ■ ■ ,Pk} is still the correct result set. 

A very intuitive fact is illustrated in the above theorem: when the movement 
of the query point is small, the query result will not change. 

Another important issue in static nearest neighbor query is to get the first 
solution quickly. In our progressive algorithm, even though we have to start a 
new search, some sites in the previous result set will still appear in our new 
result. It makes us to think about the following problem: which part in the 
previous result will still be in the new search result after the query point moves 
a distance 81 The following theorem is a solution. 

Theorem 3. Suppose at time t, k nearest neighbors to query point at location 
qt are {p\,P 2 , • • ■ ,Pk}, D t {k ) is the maximum distance of these sites to qt and 
D t (k + 1) is the minimum distance of the sites outside that set. At time t + 1, 
the query point moves to q t + 1 , we claim that the new result still contains sites 
Pi with D t (i ) < D t (k + 1) — 28, where 8 is the distance between q t and qt+i- 

Proof. From the previous theorems, we know that at time t + 1, D t +i(k + 1) > 
D t (k+ 1) — <5. For each site pt in the result set, by triangular inequality, we have 
dist(q t +i,pi) < D t (i) + 8. Put these two together, if D t (i) < D t (k + 1) — 28 
holds, we have 

distance(q t +i,Pi ) < D t {i) + 8 < D t (k + 1) — 8 < D t+ \(k + 1) 
which concludes the proof. 

Here is the lazy search algorithm. At sampled position 1, the following infor- 
mation is maintained: the correct query result set, the position of query point, 
Di(k) and D\(k + 1). Later at sampled position t > 1, the first step is to check 
whether the buffered query result is still correct (theorem El). If so, return the 
result. Otherwise, find the “still-correct” sites (theorem |3J) . Then calculate the 
initial et (theorem Q) and start a static search. After the new result set is found, 
store the result set into our buffer, along with the value of D t {k ), D t (k + 1) and 
qt', and wait for the next query. 
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Fig. 2. In some cases, there is no need to update the buffer 



3.3 Pre-fetching Search Algorithm 

Pre- fetching technique is widely used in disk-bounded environments. Comparing 
with the lazy search algorithm, this time, we go one step further. We execute 
m-nearest neighbor search at some sampled positions where in > k, the results 
are stored in a buffer which is maintained in memory. At the remaining sampled 
positions, we only check the buffer information and do not search the disk. 

One problem remains for this algorithm: since the buffer stores the m nearest 
neighbors at previous query position, when we should updated the contents in 
the buffer? We propose the following theorem as a solution. 

Theorem 4. Suppose at time t, the m nearest neighbors of query point at loca- 
tion qt are stored in the buffer, where m > k. D t {k) and D t (m) are the k-th and 
m-th distance among those sites. Later the query point moves to a new location 
q' . We claim that there is no need to update the contents in the buffer if 

s < D t{m ) ~Dt(k) 

2 

where S is the distance between qt and q' . 

Proof. As indicated by theorem [El when query point is at q ' , e(q') is valid if 
e(q') > D t (k) + S. Look at Figure 0 the information of the sites which are in the 
dashed circle is stored in our buffer. It is clear that there is no need to update the 
contents in the buffer if and only if the dashed circle contains the doted circle. 
That is: e(q') < D t (m) — 6 . Combine this with the conclusion in theorem 0 we 
have 



D t {k) + S < e(q') < D t (m) - S 
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5 < AM -Dt(k) 

2 

Theorem El tells us when the contents in the buffer are obsolete and required 
to be updated. Observing the conclusion, we find that the lazy search algorithm 
is a special case of this algorithm if we set m = k + 1. There is no corresponding 
theorem about first solution in this algorithm. But it would be very helpful if 
we inspect the buffer and check the sites whose distances to q' are less than or 
equal to Dt ^ s before a new search. 

Here are the details of the pre-fetching search algorithm: at sampled position 
1, an extended m nearest neighbor search algorithm is executed. The results 
are stored in a buffer. Among them, the k nearest neighbors are identified. The 
information about D\{k), Di(m) and the query position are stored too. Later 
at sampled position t > 1, after the position of query point qt is obtained, our 
first check is whether the results are in our buffer (theorem 0). If the answer 
is yes, we only need to examine the sites in the buffer; if the answer is no, a 
new static m-nearest neighbor search is launched. After the search, the value 
of D t (k), D t (m) and qt are stored and prepared for actions at later sampled 
positions. 

3.4 Dual Buffer Search 

In some applications, the next position of the query point can be partly pre- 
dicted. For example, in an air-traffic control system, the moving query point is 
an airplane. At any given time t , qt+i can be calculated based on the speed of 
object and the length of time interval. However, the prediction of qt+i cannot 
be 100% correct because of many unforeseeable reasons. 

In this paper, we adopt the assumption about the worst-case sampling error 
of the prediction of the position made in suppose at sampled position t, 
the query point is at qt . The prediction of the position is a pair of data (q' t+l ,r) 
so that the next position q t +i has the same possibility to be within the circle 
around q' t+ i, and r is the radius. Here q' t+1 is called “predicted next position” of 
the query point and r is called “uncertainty factor”. 

Here is the basic idea of this algorithm. If during the search procedure at 
time t, for each site candidate, we also check the distance from the site to q' t+1 
and store the k best sites in another buffer, then at the next sampled position 
t + 1, we may have a better start. 

The details of the dual buffer search algorithm is as following. There are two 
buffers with size k in the system. The first buffer is for the site candidates of the 
current position and the second one is for the candidates of the next position. At 
sampled position 1, we do a routine fc-nearest neighbor search. Meanwhile, for 
each site we check, we also calculate the distance of the site to the predicted next 
position of query point q' 2 . Among all the checked sites, the k current nearest 
neighbors are maintained in the first buffer and the k best results to q' 2 are stored 
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in the second buffer. Later at sampled position t > 1, the sites in the second 
buffer are moved to the first buffer and the maximum distance of the sites to qt 
in the buffer is used as the initial search bound. At the same time, in the second 
buffer, the k sites are still kept. This time, they are sorted by the distances to the 
new predicted next query position q' t+ i- Then, a new fc-nearest neighbor search 
is launched and both buffers are updated. 

With the second buffer, the initial search bound may be lowered. 

Theorem 5. In the dual buffer search algorithm, at any sample position t, the 
real initial search bound (the maximum distance of the sites in the second buffer 
to qt) is no greater than r plus the predicted initial search bound (the maximum 
distance of the sites in the second buffer to q' t ). If r = 0, at all sampled positions, 
the initial search bound in the dual buffer search algorithm is always no greater 
than that in the fixed upper bound algorithm. 

The proof is obvious and is omitted in this paper. In real life, r cannot always 
be zero. If r is large, the real initial search bound may be even worse than that 
in the fixed upper bound algorithm. In the experiment section, we will use an 
experiment to show the impact of r. 



3.5 Discussion 

In this part, we give some discussion about the parameters that affect the per- 
formance of our four algorithms: the fixed upper bound algorithm, the lazy 
search algorithm, the pre-fetching search algorithm and the dual buffer search 
algorithm. 

The fixed upper bound algorithm always launches a search at new sampled 
position with a better initial search bound. This guarantees that in any situa- 
tion, it works better than the static solution because more nodes are pruned in 
the algorithm. The dual buffer search algorithm adopts the similar idea. If the 
prediction of the next position is precise, the initial search bound will be even 
lowered. 

The cost of a new search in the lazy search algorithm is a little more expen- 
sive than that in the fixed upper bound algorithm. The reason is mainly because 
in the lazy search algorithm, extra information, D(k + 1), is collected. For the 
same reason, a new search in the pre-fetching search is even more expensive. 
Fortunately, we do not have to launch a new search at every sampled position 
in both algorithms. According to our theorems, at some sampled positions, us- 
ing the information in our buffer is enough. We call these positions “no search 
positions” . It is obvious that the more “no search positions” we have, the better 
performance the algorithms reach. 

At least three factors affect the performance of the lazy search algorithm: 
number of sites, speed of the moving query point and k. Here we assume that 
the sites are distributed uniformly. 
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According to theorem^! a sampled position is a “no search position” if during 
the time period, query point moves a small distance <5 < Pt (fc+i)--P t (fc) _ wii en 
the number of sites is large, D t (k + 1) — D t {k ) will be small; when the speed of 
the moving query point is fast, S will be big. In both cases, the number of “no 
search positions” decreases. 

The relation between k and the performance is a little more complicated. By 
definition, we know that D t (i) is a non-decrease sequence. At the same time, 
we must also point out that when sites are in uniform distribution, D t (k) is 
proportional to It means that the value of D t (k + 1) — D t (k) reduces as k 
grows. So does the number of “no search positions”. 

Besides the above three parameters, another one that is important for the 
pre-fetching search algorithm is the buffer size. There is a tradeoff here: main- 
taining a small pre-fetched buffer lowers the cost for individual disk search, and 
maintaining a large one generates more “no search positions”. In the next sec- 
tion, we will use some experiments to demonstrate the impact of the buffer size. 

In sum, the progressive methods are better than static methods. Among 
them, the lazy search algorithm and the pre-fetching search algorithm have bet- 
ter performance for small site number, slow moving speed of query point, and 
little k. This will be showed in our experiments. 

4 Experimental Result 

To access the merit, we performed some experimental evaluations of different 
methods on both synthetic and real data sets. 

4.1 Experiment Settings and Methods Selection 

In our experiments, we use a unit square [0, l) 2 as working space. In both syn- 
thetic and real data sets, the sites are normalized into the unit space. All sites 
are organized in a R-tree 0. 

Along with the naive solution and our four algorithms, we also test the query 
performance of the brutal search algorithm. We choose brutal search algorithm 
because it is the only method available if the sites are not indexed. In the pre- 
fetching search algorithm, we select three different buffer sizes: twice, five times 
and ten times as much as the result size. 

The abbreviations of algorithms used in this part are showed in Table d 

Experiments were performed on a Sun SPARCstation 5 workstation running 
on Solaris 2.7. The workstation has 128 M Bytes of main memory and 4 G 
Bytes of disk. The page size is 4 K Bytes for both disk I/O and R-tree nodes. 
In real-world data sets, the information of each site consists of its id, name, 
location, etc. The size of each record is 100 bytes, which leads to 40 points per 
disk page. In synthetic data sets, the size of each data is 20 bytes (4 bytes for 
its id and 8 bytes for each dimension) which lead to 200 points per disk page. 
The experiments were written in JAVA 2.0 
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Table 1. Algorithm abbreviations 



BS 


Brutal Search Algorithm 


SBB 


Static Branch-and-Bound Algorithm (naive solution) 


FUB 


Fixed Upper Bound Algorithm 


LS 


Lazy Search Algorithm 


PS-m 


Pre-fetching Search Algorithm with buffer size m x k (For example, 
PS-2 means the buffer size we choose is twice as much as k.) 


DBS 


Dual Buffer Search algorithm 



4.2 Data Sets 

The real-world data sets we used come from TIGER (2j. The particular three 
data sets were 120 hospitals, 1982 churches and 1603 schools in Maryland, USA. 
Figure 00 and El show the site distribution of each set. 




Fig. 3. Hospital distribution 




Fig. 4. Church distribution 



The synthetic data set contains a large number of uniformly distributed sites 
in working space. 

In our experiments, we measure the performance of various algorithms based 
on the number of disk pages they accessed. In most cases, nearest neighbor search 
problem is disk-bounded and the computational cost is trivial comparing with 
the I/O cost. The average number of disk pages each method accesses at every 
sampled position provides a direct indication of its I/O performance. For LS, 
PS-2, PS-5 and PS- 10, we also record the percentage of “no search position”. 
This one provides useful information about the reason why LS and PS-m work 
well in some cases. 

4.3 Query Sets 

The moving queries are generated as following. First randomly pick a point in 
working space as the start point. Then we define a speed for the moving object. 
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Fig. 5. School distribution 



The moving object keeps that speed but with a 10% error. For example, if 
the defined speed is 0.001, the actual speed of the moving object is randomly 
picked from 0.0009 to 0.0011. At each sampled position, we calculate the current 
position of the moving object and execute the algorithms. In each continuous 
query, the time is divided into 1,000 intervals. The results we listed are the 
average values of more than 100 queries. 



4.4 Experimental Results Using Synthetic Data Sets 

We use the synthetic data sets to study the parameters that affect the perfor- 
mance of algorithms: r, site number, speed of moving object and k. 

In the first experiment, we want to compare the initial search bound in FUBS 
and DBS with different “uncertainty factor” r. To make the result clear, we only 
display the ratio numbers (i.e. the initial search bound / D{k)). The lower the 
ratio number is, the better the algorithm will be. The other parameters are fixed: 
k = 10, speed = 0.0001 and the site number is 100,000. r varies from 0 to 0.5 x 6 
where S is the actual distance between two consecutive query positions. In figure 
O y-axis is the ratio number and axaxis is the value of r/S. 




0 0.1 0.2 0.3 0.5 



Fig. 6. The initial search bound of FUBS and DBS with different r 




92 



Z. Song and N. Roussopoulos 



Figure 0 shows that DBS has smaller initial search bound only when the 
value of r is low (less than 15% of 8). As r grows, i.e. the prediction becomes 
not very precise, the performance of DBS is worse than FUBS. That means if 
the movement pattern of the object is random-like, the prediction information 
is not helpful but has negative impact on the performance. In the following 
experiments concerning DBS, we fix r to be 0.1 x 8. 

In the second experiment, we vary the site number from 1,000 to 1,000,000 
to compare the performance of different algorithms. The other two parameters 
are fixed: k = 10 and speed = 0.0001. 





T,. - , , Fie. 8. Percentage of no search position vs 

i lg. 7. Disk page access vs. site number . ° ^ 



In Figure 0 x-axis is the number of sites we indexed for the queries and 
y-axis is the average pages accessed for each method at every sampled position. 
In Figure 0 y-axis is the percentage of “no search positions” for four algorithms 
with pre-fetching information. 

The following facts are showed in Figure 0 and Figure 0 

1. R-tree grows when the site number increases. All methods require checking 
more disk pages for larger site number. 

2. FUB always outperforms SBB because of its good initial search bound. 

3. Due to the lower initial search bound, DBS always beats FUBS by checking 
about 10% less nodes. 

4. The percentage numbers of “no search position” for LS, PS-2, PS-5, and 
PS-10 are more than 80% when the site number is 1,000. Among them, the 
numbers of PS-5 and PS-10 are close to 100%. The outcome is that their 
performances are much better than other three opponents. 

5. As the site number increases, the percentage numbers drop quickly. When 
the site number is 1,000,000. The numbers of all four algorithms become 
0%. That means the information we pre-fetched is totally useless. At this 
moment, the bigger buffer size we choose, the worst performance it becomes. 
PS-10 and PS-5 even access more disk pages than the naive solution. 
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6. For the pre- fetching search algorithms with large buffer size (PS-5 and PS- 
10), the performance easily becomes very bad if the percentage of “no search 
position” decreases. It means that choosing large buffer size for pre-fetching 
search algorithm sometimes may be very dangerous. 

In the next experiment, we vary k from 1 to 100 and fix site number to be 
100,000 and speed to be 0.0001. The result is showed in Figure El and E3 £-axis 
is the value of k. 




Generally, all methods except BS check more disk pages when k grows, and 
FUB always outperform SBB by a factor of 3. DBS has similar curve as FUB 
but the corresponding value is about 10% less. LS and three PS-to algorithms 
follow the same pattern as in the first experiment. The little difference is the 
percentage number is not exactly 0 when k = 100. In three pre-fetching search 
algorithms, increasing buffer size brings very little benefit when k = 100. (Double 
the buffer size from PS-5 to PS-10 only adds 0.2% to the percentage of “no search 
position”.) It is worth noting that even with such a small percentage number, 
LS still outperforms FUB. 

Finally, we vary the query point speed from 0.00005 to 0.001 and fix the 
point number to be 100,000 and k to be 10. ir-axis is the value of speed. 

As Figure El and El show, the performance pattern remains to be the same. 
This time, the percentage of “no search position” drops much faster. PS-10 and 
PS-5 win only when the speed of query point is very slow, and lose to most 
algorithms in the rest cases. 

In conclusion, FUB, DBS and LS always outperform the pure static method 
by a factor of 3 except for very small site number. When the prediction of the 
next position of the query point is precise, the extra information helps DBS to 
beat FUB by accessing about 90% of disk pages. LS is better than FUB in most 
cases when site number is not too large and speed of query point is not too fast. 
PS-to has the best performance when the percentage of “no search position” is 
large, but lost to FUB and LS in most situations. The pre-fetching algorithms 
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Fig. 11. Disk page access vs. speed 



Fig. 12. Percentage of no search period vs. 
speed 



with large buffer size are more sensitive to parameter change than ones with 
small buffer size. 

4.5 Experimental Results Using Real Data Sets 

In real-world data sets, we want to simulation the following situation: a car moves 
at a normal speed (65 mph or 105 km per hour with 20% error) in Maryland, 
USA. The route of the car is randomly selected. A real-time report about the 
10 nearest sites is provided and the time interval is one minute. 

In the experiments, we test three data sets separately and in the fourth 
experiment, we put these site data together. 




Fig. 13. Disk page access of different methods in four data sets 



Figure El compares the number of R-tree nodes fetched from disk by each 
algorithm at every sampled position. In the hospital dataset, since the site num- 
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ber is too small (only about 100), the improvement of our algorithms is not very 
clear. In the rest three cases, the proposed FUB, LS, PS-?n and DBS algorithms 
access a much smaller number of R-tree nodes than SBB and BS methods. 

5 Conclusion 

We studied the fc-nearest neighbor search problem on moving query point and 
proposed progressive methods which minimize the query operation by using in- 
formation of previous query and pre-fetched results. Our idea is easy to be imple- 
mented and can be combined with any single-pass or multiple-pass static nearest 
neighbor search algorithms, although we use static branch-and-bound algorithm 
in this paper. All the techniques we presented boost the query performance 
greatly. 

In this paper, the blind pre- fetching algorithm buffers a lot of useless points. 
If we have the information about the path of the query point, how to efficiently 
buffer the pre-fetch results is a challenging problem. 
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Abstract. A method is presented in this paper for answering location- 
dependent queries in a mobile computing environment. We investigate a 
common scenario where data objects (e.g., restaurants and gas stations) 
are stationary while clients that issue queries about the data objects are 
mobile. Our proposed technique constructs a Voronoi Diagram (VD) on 
the data objects to serve as an index for them. A VD defines, for each 
data object d, the region within which d is the nearest point to any mo- 
bile client within that region. As such, the VD can be used to answer 
nearest-neighbor queries directly. Furthermore, the area within which 
the answer is valid can be computed. Based on the VD, we develop a 
semantic caching scheme that records a cached item as well as its valid 
range. A simulation is conducted to study the performance of the pro- 
posed semantic cache in comparison with the traditional cache and the 
baseline case where no cache is used. We show that the semantic cache 
has a much better performance than the other two methods. 

Keywords: mobile computing, location-dependent query, Voronoi Di- 
agrams 



1 Introduction 

With the advance of wireless networks and the popularity of portable electronic 
devices, mobile computing has been one of the hottest topics in computer science 
research in the past several years. Mobility has created new challenges to the 
existing computing infrastructures, such as databases, networks and so on. In 
the database research area, for example, data models must support the notion of 
user and data mobility as first-class data types. Furthermore, database systems 
must be able to represent location information efficiently to support complex 
location-dependent queries. 
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The fact that clients in a mobile environment can change locations opens up 
the possibility of answering queries in a way that is dependent on the current 
position of the client [I]. These kinds of queries are called location- dependent 
queries. Examples of location-dependent queries are “find the nearest gas station 
to my current location,” “find all the cinemas within a 1 km radius,” “which 
buses will pass by me in the next 10 minutes?” , and so on. While the data objects 
in the first two examples are stationary, they are mobile in the last example. 

The provision of location-dependent information for the same user at differ- 
ent locations is a challenging problem. In addition, queries should be processed 
in such a way that consumption of wireless network bandwidth and battery 
power of the portable client is kept to a minimum. Techniques such as broad- 
cast, caching, and indexing have been developed for this purpose. The focus of 
this paper is on location-dependent queries. Since users are mobile in a mobile 
computing environment, location-dependent queries must be answered accord- 
ing to the user’s current location. For example, “find the nearest restaurant” 
would return totally different answers to the same user when the query is is- 
sued at different locations. If a user keeps moving after he /she submits a query, 
the problem becomes more complicated because the user’s location is changing 
continuously and thus the results would change accordingly. How to answer a 
continuous query and provide an accurate answer is an open question. 

Although a lot of research work been done in this area, methods for answering 
location-dependent queries efficiently and also guaranteeing the validation of 
the answer are not available. Our method makes use of Voronoi Diagrams to 
preprocess the data in order to answer location-dependent queries quickly, and 
a semantic cache to validate the answer. 

Section 2 of this paper gives the definition of location-dependent queries, 
describes the difficulties of solving them, and outlines some existing work done 
in this field. Our method is described in Section 3. Section 4 gives the simulation 
and performance evaluation results. Finally, the summary and future work of this 
project are given in Section 5. 



2 Related Work 

In order to support location dependent queries, some basic requirements must 
be met: 

1. Locating the user: The prerequisite of answering location-dependent queries 
is to locate the user. Available technologies that can provide user locations 
include GPS (Global Positioning System) and cellular wireless networks. 
Since these technologies have inherent errors, applications demanding high 
precision must take into account the location errors during query processing. 

2. Maintaining the mobility of moving objects: Since the location information 
of moving objects keeps changing all the time, storing the location in the 
database and updating it whenever it changes is not an acceptable solution 
because of the frequent updates required. Some dynamic, intelligent repre- 
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sentation should be adopted in order to reduce the number of updating to 
the database. 

3. Predicting future movement trends: There is a time interval between the 
submission of the query and the return of the answer to the user. As such, 
prediction and validation must be done in order to guarantee that the answer 
is correct at the time when it is received by the user (not just at the time 
when the answer is computed). Also, there are queries about future situations 
such as forecasting traffic conditions of particular areas. In order to answer 
these kinds of queries, some future information should be predicted from 
current information. 

4. Processing queries efficiently: Due to the expensive bandwidth of the wireless 
network and the limitations of computing resources for portable devices, 
query processing must be done efficiently. Technologies such as caching, data 
replication, and indexing can be used to improve efficiency. 

5. Guaranteeing the boundaries of the precision: Neither the position of a cur- 
rent location nor the prediction of a future location is 100% accurate, so 
some mechanisms should be put in place to provide a bound to the error. 

It is clear that many research problems have to be addressed and resolved 
before these requirements can be met satisfactorily. In this paper, we focus on 
the query processing aspect of location-dependent queries. A brief summary of 
work related to query processing is given in the following subsections. 

2.1 Caching 

The client cache stores frequently used information on the client so that queries 
can be answered without connecting to the server. In addition to providing 
answers quickly, the cache can also provide a limited level of services when a 
connection is not available. 

— Data caches: Just like a traditional cache in a database, a data cache stores 
recently accessed data in the mobile client in order to save wireless bandwidth 
and improve access time. For location-dependent information such as local 
traffic information, cached data should also be validated when the client 
changes location. Xu et al. proposed a bit-vector approach to identify the 
valid scope of the data, and investigated a couple of advanced methods of 
validating caches based on this approach ca- 

— Semantic caches: A semantic cache stores data and a semantic description 
of the data in the mobile client |3J. The semantic description enables the 
cache to provide partial answers to queries which don’t match the cache data 
exactly. As such, wireless traffic can be reduced and queries may be answered 
even in the case of disconnections. This characteristic makes a semantic 
cache an ideal scheme for location-dependent queries. A cache method was 
proposed in jrnj. a tuple S = (Sr, Sa, Sp , S p , Sc) was used to record data 
in the local client. Sr and Sa are, respectively, the relationships and the 
attributes in S', Sp is the selection conditions that data in S satisfy; Sl is 
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the bound of the location; and Sc represents the actual content of S. When 
a query is received by the client, it is trimmed into two disjointed parts: a 
probe query that can be answered by some cached data in the client, and a 
remainder query that has to be transmitted to the server for evaluation. 



2.2 Continuous Queries 

A location- dependent query becomes more difficult to answer when it is submit- 
ted as a continuous query. For example, a client in a moving car may submit 
the query: “Tell me the room rate of all the hotels within a 500 meter radius 
from me” continuously in order to find a cheap hotel. Since the client keeps 
moving, the query result becomes time-sensitive in that each result corresponds 
to one particular position and has a valid duration because of location depen- 
dency. The representation of this duration and how to transmit it to the client 
are the major focuses of Continuous Queries (CQ). Sistla et al. employed a tu- 
ple (S', begin, end) to bound the valid time duration of the query result j 1 31 1 4| . 
Based on this method, they also developed two approaches to transmitting the 
results to the client: an immediate approach and a delayed approach. The former 
transmits the results immediately after they are computed. Thus, some later 
updates may cause changes to the results. The latter transmits S only at time 
begin, so the results will be returned to the client periodically, thus increasing 
the wireless network burden. To alleviate the limitations of both approaches, 
new approaches, such as the Periodic Transmission (PT) Approach, the Adap- 
tive Periodic Transmission (APT) approach and the Mixed Transmission (MT) 
Approach, were proposed m- 

2.3 Query Types 

According to the mobility of the clients and the data objects to be queried by 
the clients, location-dependent queries can be classified into three types: 

— Mobile clients querying static objects: Queries like: “Tell me where the near- 
est gas station is” and “Where is the nearest restaurant?” are popular queries 
in real-world applications. In general, the clients submitting this kind of 
queries are mobile and the data objects are fixed. The main challenge of this 
type of queries is how to get the locations of the clients and also guarantee 
the validation of the results when the client keeps moving during the query 
evaluation process. Queries such as “Report all the available hospitals within 
a 500-meter radius” are an extension of this type of query. 

— Stationary clients querying moving objects: An example of this type of query 
is “Report all the cars that pass gas station A in the next 10 minutes.” Here, 
gas station A is static and the moving cars are the objects being queried. 
The result only depends on the location of the moving cars. Actually, this 
is an extension of traditional database queries with dynamic answers to the 
same query. This query is considered as a location-dependent query because 
the data objects are all mobile and the answer to the query depends on the 
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locations of the data objects. Consequently, this type of query requires good 
representation of moving objects and efficient indexing techniques. Usually, 
this type of query is interested in information about the future, so later 
updates will cause some objects to become unqualified for the query condi- 
tion. As such, the real-time guarantee of the validity of the answer is also 
a challenge. In an earlier paper i2jj, the authors mentioned the invalidation 
of results containing location-dependent data caused by later updates, and 
considered that objects that satisfy the query at one moment might become 
disqualified after being updated. 

— Mobile clients querying mobile objects: That both the clients submitting the 
queries and the data objects are continuously moving is the main character- 
istics of this kind of query. For example, a query of this type: “Tell me all 
the cars that will pass me after 20 minutes” while the client is driving on a 
highway, is very complicated since it is a combination of the first two types. 
Even a simple query like: “Tell me all the cars that will pass me within the 
next 20 minutes” bears the characteristics of CQs. 

Of course, all the queries listed above can also contain conditions querying 
about location-independent attributes, such as: “Tell me the nearest restaurant 
providing Chinese food.” Since these queries can be broken down into two parts: 
one for location-dependent information and the other for location-independent 
attributes, we only consider queries on location-dependent information as the 
others can be retrieved by traditional query-processing methods. 

Most, if not all, of the location-dependent queries can be categorized as one 
of the three types described above. We can analyze each type separately in order 
to define the scenario clearly and simplify the problem. The rest of this paper 
will focus on the first type since some research on the second type has already 
been done 03 , and research on the third type can be simplified if solutions 
to the first two types have been found. However, to the best of our knowledge, 
no previous research has been done on type one queries. The remainder of this 
paper will introduce a technique in order to solve the first type of query. 

3 Voronoi-Diagram-Based Indexing 

From the example queries in the previous section, we can see that queries to 
find the nearest service are very useful and popular. In this section, we present 
a method for retrieving the nearest service efficiently in a mobile environment. 
An assumption of our work is that the location of a client is known from GPS. 
When a user submits a query, its current location and speed together with the 
timestamp are also submitted. 

3.1 Basic Data Structure 

Our method makes use of Voronoi Diagrams (VDs), which, by nature, are suit- 
able for nearest-neighbor queries. A Voronoi Diagram records information about 



102 B. Zheng and D.L. Lee 



what is close to what. Let P = { pi,p2 , • ■ ■ ,p n } be a set of points in the plane 
(or in any n-dimensional space), each of which we call a site. Define V(pi), the 
Voronoi cell for p,, to be the set of points q in the plane such that dist(q,Pi) < 
dist(q,pj). That is, the Voronoi cell for pi consists of the set of points for which 
Pi is the unique nearest neighbor of q: 

V(p») = {q\dist(q,pi) < dist(q,pj),Wj ± *}. (1) 

As shown in Figure^ all the points in the shadowed region Area\ have the same 
nearest fixed point, namely, 0\. 




Fig. 1 . Voronoi Diagrams used in nearest neighbor queries 



Since a VD is a traditional data structure in computational geometry, ad- 
equate structures for storing a VD and efficient point location methods for lo- 
cating a point in a region are available [3j. As such, we assume without further 
description that a standard structure for representing the information in a VD 
and a corresponding location method are available. 

In the scenario of finding the nearest restaurant, the restaurants are the fixed 
sites and the mobile clients are the points needing to be located. Once the mobile 
client is located in one area, the unique fixed site of this region is its nearest 
neighbour. The good thing about this type of query is that all the services are 
fixed and only change occasionally. Furthermore, the location information of 
the services is available before query submission. Thus, preprocessing can be 
done to construct the corresponding VD of different services. Three data struc- 
tures are used to record the preprocessed data. The first one is edge, denoted by 
(id, X\, j/i , X2, 1/2}, which is used to record the segment id and the endpoints of a 
segment. The second one is service object; it records the position of the service 
object (i.e., the site in the definition of Voronoi Diagrams) and its bounding 
edges. It is a tuple (id, x,y, number, list) , where x and y are the coordinates 
of the site, number is the number of edges bounding this site, and list is the 
list that records the ids of all the edges. The last one is edgeservice, which 
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records the information between the service objects and the edges using a tu- 
ple (segment Jd, serv-objectJdi, serv-objectjid 2 ) ■ Since the return set of point 
location algorithm that we use is the edge that is just below that point, the map- 
ping relationship between the edge and corresponding site should be maintained. 
id\ and icfe are the sites above and below this edge respectively. Currently, this 
method is just like the traditional static data cache for nearest neighbor queries, 
but there are two major differences: 

1. The client submitting the query is always mobile, so the query processing 
method should handle the validation of the answers while the location of the 
client changes. It should also predict the possible future answer according to 
the direction and speed of the mobile client, i.e., the result of our method is 
dynamic while the traditional answer is static. 

2. Our method supports the semantic cache that is not provided in a traditional 
database. 

3.2 Data Preprocessing 

Although VD is the most suitable mechanism to find the nearest neighbor, it 
is seldom used in real applications because of the expensive maintenance of 
the structure when updating occurs. Fortunately, in the context of this paper, 
the chance of having to change a real service (moving or rebuilding) is very 
small. Furthermore, a large geographic area is likely to be divided into small 
regions, each of which is usually a cell covered by a base station in the wireless 
system. Within each region, we can classify the services into different kinds 
such as restaurants, hospitals, gas stations, etc. For each kind of service, a VD 
index is constructed based on the data structures described above. Overall, the 
maintenance cost of a VD index is reasonable considering the gain in query 
performance. 

3.3 Query Processing and Semantic Caching 

Now we consider how to answer the query: “Tell me where the nearest restaurant 
is.” When answering this query, we report the nearest restaurant from the VD 
index. Based on the known speed of the client, the next nearest restaurant can 
also be predicted. As denoted in Figure 0 point P is the mobile client and the 
arrow indicates the direction of the client’s movement. Currently, the nearest 
restaurant to P is 0\ and after P crosses the line between 0\ and O 4 , the 
nearest restaurant should be O 4 . Knowing the length of the dash line and the 
speed of P, we can estimate the time when P crosses the line, say, two minutes 
later. Then the answer should be presented as (0 i, 2,0 4 ). The first object of 
this tuple is the answer according to the current location of client P, the second 
element is the time interval during which the first answer is valid, and the last 
one is the predicted new answer after the valid time interval. 

One thing to notice is that the client may change its speed and direction. 
Thus, the valid duration of the first answer cannot be guaranteed using this 
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Fig. 2. Semantic Caches in Voronoi Diagramss 



mechanism. One way to prevent any false answers is to resubmit the query when 
the client changes its speed or direction. Here we propose a better alternative that 
uses the maximum speed and the shortest distance to guarantee a valid duration. 
The shortest vertical distance between the current position of the client and the 
bounding edges of this site divided by the maximum speed of the mobile client 
can be guaranteed to be valid. This method may make the valid duration of the 
answer shorter than the real one, but it will not produce any invalid results. 

A circle in Figure El should also be noted. This is the maximal circle having 
P as its center and the whole circle is within the region of 0\. Actually, the 
radius of this circle is the shortest distance that we used in our scheme to get 
the valid duration of the first answer. We store the location of P, the radius 
of the circle, and also 0\ in the client cache as (P.x, P.y, radius, O i). From the 
above example, we can see that the cache actually contains information about 
many circles. If the position of the client submitting the query is within one of 
the circles, then the nearest restaurant within this circle is the answer to the 
query, which can be answered without connecting to the server. For simplicity, 
the predicted answer is omitted for clients who can get the result from a local 
cache. In other cases , we should transmit the whole query to the server and 
create one new record in the cache after we get the result. 

In summary, the following steps are taken by both the client and the server 
to answer a query: 

1. The local cache is checked to see whether the information is available. If there 
are no suitable cache records corresponding to the location of the client, one 
should proceed to the third step. Otherwise, one should continue. 

2. If there is some related information, the most appropriate piece can be re- 
trieved from the cache and then this query is finished. 

3. The current location of the client and also the speed of it are transmitted 
to the server. The server will first locate this client in the VD index and 
find the nearest restaurant, and then compute the maximal circle around it 
within this region. 

4. Based on the region of the nearest restaurant and the speed of the client, we 
can identify the time when this client moves to another region. 

5. The result is returned to the client. 
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6. After the client gets the result from the server, a new cache record is inserted 
into the local cache. 

In Step 4, we use the first scheme (which assumes a constant speed and 
direction) . If the second scheme that can guarantee the valid duration described 
above is used, the maximum speed rather than the current speed of the client 
should be used. 



4 Simulation Model 

In order to evaluate the performance of the proposed method, a simple simulation 
model has been established using CSIM El- In the following section, the model 
setting is described first and then the performance evaluation result is presented. 



4.1 Model Setting 

In our simulation model, we simulate one cell, i..e., one base station covering a 
number of clients. The cell has a limited geographical coverage and the clients 
have a speed limit. When a client crosses the border of the cell, it is deleted 
and a new client lying within the cell is produced randomly. In other words, 
the number of clients within one cell is constant. The server answers queries 
according to the order of which the requests arrived. It takes ServiceTime to 
finish one query. The client sends out a new query only after the current one 
has been answered. It can send out the new query immediately or after some 
time. The longest duration is denoted as Max -Thinktime. ServNum means the 
number of the facilities available within this cell. Assuming that we are interested 
in the nearest restaurant, the facility here is the restaurants and ServNum is 
the number of restaurants within the cell. Tabled is a summary of the setting. 
In the following experiments, the setting is used unless otherwise stated. 

Given ServNum, the positions of the service objects are produced randomly, 
and the corresponding Voronoi diagrams are produced using an available pro- 
gram Triangle ca, which uses 0(n x log(n )) space to build a VD of n service 
points. 

Figure 0 shows a set of 20 nodes (i.e., ServNum = 20) produced randomly 
by the system. Figure 0 shows the corresponding Voronoi Diagram produced. 
Based on the VD produced, all the necessary data can be precomputed and 
stored in the memory for the simulation program to use. 



4.2 Simulation Results 

In our simulation, a large number of tests have been carried out. However, due to 
limitations of space, only some of the performance graphs are shown. For those 
which are not shown, we describe the observations whenever necessary. 
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Fig. 3. The Location of the restaurants 




Fig. 4. Corresponding Voronoi diagrams 
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Table 1. Simulation Model Settings 



Parameter 


Description 


Setting 


SimNum 


No. of query answered by the server in a simulation. 
When the server answers SimNum queries, 
the simulation is finished. 


5000- 

45000 


ClientNum 


No. of clients 


1000 


ServNum 


No. of the service objects 


10,20,30 


Upper_Y 


the maximal value of y_coordinate. 


1000 


Lower_Y 


the minimal value of y .coordinate. 


0 


Right JX 


the maximal value of x.coordinate. 


1000 


Left_X 


the minimal value of x.coordinate. 


0 


Max_Speed 


the maximal speed of the client. 


10 


ServiceTime 


the time server used to answer one query and also 
send the request. 


1.0 


Max.Thinktime 


the maximal time duration between one client receive 
the answer to the original query and send out next 
request. 


20.0 


CacheSize 


the size of the client’s cache. 


10 



Client Cache Types. The cache has a significant impact on the performance 
of this method. We tested the average waiting time for the clients using different 
cache schemes: no cache means that the client submits every query to the server 
and does not store any answers for later use; normal data cache means that 
the client just stores the answer to a query in the local cache and the cached 
answer can be reused when it submits the same query at the same location, and 
semantic cache means that the client stores not only the answer in the local 
cache but also the description of the valid area of this answer. 

The performance of these three cache schemes is given in Figure 0 and Fig- 
ure 0 with 10 and 50 service [tbp] objects, respectively. We also performed a 
simulation using 20 and 30 service objects and found that the results were sim- 
ilar. We observed that for clients using no cache, the performance is the worst, 
whereas the semantic cache produces the best result. This result is consistent 
with common sense - for no cache , every query must be submitted to the server; 
for a normal data cache , the chance that a client submits the same query at the 
same location is nearly zero, and so is the probability of reusing cached answers; 
and for a semantic cache , the client has a much higher probability of remaining 
in the area where the cached answer is valid and thus experiencing the best 
performance. 

In the simulation, the positions and the speeds of the clients are produced 
by CSIM randomly. At the beginning of the simulation, the whole system is 
not in a steady state since there are fewer requests. Thus, the average waiting 
time during the initial period is shorter than the average in the stable state. 
In Figures El and 0 the difference between the first and the second points is 



Average Waiting Time Average Waiting Time 
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Fig. 5. Average waiting time given 10 objects 




Fig. 6. Average waiting time given 50 objects 
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Number of Snapshots 

Number of Service Objects:20, Number of Queries:4.5*10 4 



Fig. 7. Average waiting time given 20 objects 



very obvious. For a cell with a different number of service objects, the warm-up 
duration is also different. Basically, the average waiting time of a semantic cache 
decreases with the increase of the simulation time as long as it has not arrived 
at the maximum cache hit rate. 

We also conducted the simulation using another benchmark program called 
GSTD , which generates sets of moving points or rectangular data following an ex- 
tensive set of distributions |Sj. Figured is the result. A large number of snapshots 
indicate that GSTD regenerates the set of moving clients at a higher frequency 
(i.e., a higher temporal resolution). 



Number of Service Objects. From the results of the above experiments, 
the conclusion is that the number of service objects can also affect the average 
waiting time. For clients with no cache, the impact is negligible, but for those 
with a semantic cache, the impact is obvious. This is because in one fixed-size 
cell, the more service objects there are, the smaller is the area in which cached 
data remain valid. This means a lower cache hit rate and a longer average waiting 
time. Figure 0 illustrates the negative impact of the number of objects on the 
cache hit rate. It can also be observed that the cache hit rate increases with the 
rise of the simulation time. 

In summary, when the number of service objects is limited, a semantic cache 
and a VD index are very efficient. When the number becomes very large, the 
semantic cache can only contain a very limited area and the cache hit rate is 
reduced. As such, the advantages of a semantic cache are not obvious. 
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Fig. 8. Average cache hit rate 



Cache Replacement Schemes. Since the size of a local cache is usually lim- 
ited, a cache replacement scheme is needed. For our method, three different cache 
replacement schemes have been devised. 

— Area: Since the cached data are represented by many circles, this scheme 
is based on the areas of the circles. The newly-added data always replaces 
cached data corresponding to the smallest area, with the aim of making the 
total area corresponding to the cached data the largest. 

— Dist: Based on the distance between any two centers of the cached data, this 
scheme replaces the cache item that has the shortest distance from the item 
to be inserted into the cache. This is based on the assumption that the client 
will soon cross the nearest voronoi cell. 

— ComA: This scheme replaces the circle that has the biggest common area 
with the newly-added one. The objective of this scheme is to make the rela- 
tively changed area less, this is a modification of the Area scheme. 

From the simulation results (Figure |2| and Figure fTTl) . Area and ComA have 
almost identical performance whereas Dist does not work as well as the other 
two. In the simulation result presented in the following subsections, we adopt 
Area as the cache replacement method. 

Frequency of Updating Speed. Using the benchmarking algorithm GSTD , 
we can control the number of snapshots. The larger the number, the more fre- 
quent the client changes its speed. Observing the simulation result presented 



Average Cache Hit Rate Average Cache Hit Rate 
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Fig. 9. Average cache hit rate given 30 objects 




Fig. 10. Average cache hit rate given 50 objects 
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Number of Snapshots 

Cache Replacement Scheme:Area, Number of Queries:3.5*1 0 4 



Fig. 11. Average cache hit rate given 20 objects 



in Figure o this also has a significant impact on the cache hit rate: the more 
frequent the change, the lower the cache hit rate. 



Range of the Think Time. The Max-Thinktime indicates the frequency that 
a client submits queries. The higher the frequency, the more queries need to 
be answered, so the longer the waiting time is. If nearly all the clients submit 
a query after a long duration, the number of queries that one server needs to 
answer will be less, so the average waiting time will be shorter. This can be 
observed in Figure [ED 

5 Conclusion 

In this paper, we presented an indexing and semantic cache method for location- 
dependent queries based on the Voronoi Diagrams. Various cache replacement 
strategies were proposed and evaluated. We conducted a simulation to evaluate 
the performance of the proposed methods under different parameter settings. 
We concluded that the semantic cache method performs much better than the 
normal data cache method. 

In this paper, we only considered a single cell where the clients and objects are 
produced randomly. In future research, we will consider the handoff problem and 
cache invalidation methods in a multiple cell environment. Furthermore, since 
the positions of the clients and data objects as well as the speed of the clients are 
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produced randomly, we will investigate the performance of the proposed schemes 
using different distributions. 
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Abstract. With the emerging availability of small and portable devices that are 
able to determine their position and to communicate wirelessly, mobile and spa- 
tially aware applications become feasible. These applications rely on informa- 
tion that is bound to locations. In this paper we present NEXUS, a platform for 
such applications, which is open for both new applications and new information 
providers, similar to the World Wide Web. Distributed servers provide location- 
based information, which is federated to an integrated view for the applications. 
To achieve this goal, we present the concept of the Augmented World Model, 
which is a common data model for location-based information. We give an 
example to show how applications can use this platform. 



1 Introduction 

If we have a look at our everyday life we will notice that location plays a very impor- 
tant role in the delivery of information. Signs close to doors describe what is behind 
that door, signs on the streets control the traffic flow, or even labels in a supermarket 
tell us what’s wrapped in there and how much it costs. Imagine what would happen if 
this intuitive way of accessing information would be extended to the digital world. 

The following scenario illustrates this vision. Someone travels to a foreign city to 
visit a famous museum. He carries a mobile device to assist him during the journey. 
This could be a personal digital assistant (PDA), that is able to communicate wirelessly 
and to determine its position. After the arrival at the train station his mobile device 
navigates him to the museum using public transport and a short walk. During the walk 

C.S. Jensen et al. (Eds.): SSTD 2001, LNCS 2121, pp. 117-135, 2001. 
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an ancient building attracts his attention and his mobile device tells him that it was the 
residence of a former king some centuries ago. He decides to investigate this further 
after the visit to the museum and directs his mobile device to remind him when he 
passes the building again. 

On approaching the museum, the mobile device presents him a virtual ticket booth 
where he can buy an admission ticket for the museum without having to wait in a 
queue. At a special gate his virtual ticket is approved and he can enter the museum 
without delay. At the same time his mobile device switches to a map of the museum 
and suggests a tour to the most important exhibits. It automatically displays detailed 
information on each exhibit where he is standing next to. 

After leaving the museum the mobile device reminds him of his plans to investigate 
the ancient building. But he decides to go to the city center where he wants to buy 
some presents. The mobile device of course helps him in getting there. As he likes 
golfing very much it informs him about a special offer of a nearby golf department in a 
big department store while he is strolling around in the city center. He decides to take a 
closer look and his mobile device helps him not only in getting to the store but also in 
finding the golf department inside the store. 

The above scenario illustrates the potential of spatially aware applications and how 
they can help us in everyday life. Some of the features described above are already 
partly available today like car navigation [15] and museum guides, but only as isolated 
applications. The NEXUS project aims at integrating all these isolated applications into 
one open platform. There, different applications can use both common and applica- 
tion-specific data and they can even communicate and cooperate with each other. We 
envision NEXUS as being a middleware that brings providers and consumers of loca- 
tion-based information together. In [7] an overview of the NEXUS platform was given 
accompanied by a very high level description of the research topics being addressed in 
our project. In this paper we present the data model and the architecture of the infor- 
mation management component in detail. 

The remaining part of this paper is organized as follows. Section 2 derives the 
requirements for the NEXUS platform from the example in the introduction and gives 
an overview of the architecture. In Section 3, the Augmented World Model, which 
forms the basis for federating data from different sources in our approach, is described. 
Spatial Model Servers, which are the components of the platform that store static 
objects, are presented in Section 4. The structure of the Federation Layer is outlined 
there as well. In Section 5 the data and control flow between the components is illus- 
trated on the basis of a sample application. Section 6 relates our approach to other con- 
cepts and projects, and Section 7 concludes the paper and gives an outlook on future 
work. 
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WWW scenario neXus scenario 



Fig. 1. Architecture of the WWW compared to the NEXUS system 



2 Overview of the NEXUS Architecture 

2.1 Requirements 

As we can see in the preceding scenario, there are a number of applications that use 
spatial data or services that are linked to locations. We call them mobile, spatially 
aware applications or neXus applications. To support such applications we need an 
open platform - the neXus system. There are two major features that this platform 
must have: First, it will be used by many different applications. There may even be 
types of applications we do not think of at the moment. We call this model the Aug- 
mented World Model. The Augmented World Model is the base for the neXus sys- 
tem’s extensibility and flexibility and it forms the interface to the applications. Second, 
everybody shall be able to contribute information to the model. In addition, existing 
data sources like the WWW shall be integrated. This may obviously lead to a great het- 
erogeneity in the data. In neXus, a federation approach is used to handle that hetero- 
geneity. 

The second requirement is quite similar to the mode of operation of the World 
Wide Web (see Figure 1). In principle, the web allows anybody to contribute data by 
setting up a web server; analogous to this, any data server which implements the 
neXus Service Interface may contribute its data to the neXus federation. Beside the 
Federation Layer, the major difference is the Augmented World Model, which allows 
the neXus system to present its data in a uniform view. This data model will be 
described in Section 3. 
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2.2 The neXus Platform 

To meet the requirements mentioned above, neXus uses distributed services to store 
data. The neXus architecture is built up in three tiers: neXus applications, nodes and 
services. If we compare this to the World Wide Web (see Figure l), neXus services 
play the role of web servers, while NEXUS applications are similar to web applications 
like browsers or applets. The middle tier, the neXus nodes, does not exist in the 
WWW architecture: it is an open federation of the underlying data providers, based on 
the location as the main integration measure. So the neXus applications only have to 
contact their closest neXus node to get an integrated view on all the location-based 
data currently available. 



App. 1 
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Fig. 2. Overview of the architecture 

As we can see in Figure 2, neXus nodes mediate between neXus applications and 
neXus services. They are responsible for distributing queries to the different services 
and for composing the results, thus providing an integrated and consistent view for the 
applications, the Augmented World Model (AWM). This view is an object-oriented 
information model of the real world, which is augmented by additional information 
(e.g. web sites). Spatial Model Servers are used to store static objects like buildings, 
streets or Virtual Information Towers, which are used to assign external data (e.g. 
WWW data) to locations. The Spatial Model Servers are registered at the Area Service 
Register, which relates geographical areas to associated Spatial Model Servers. This 
enables an efficient selection of the minimal subset of servers required to process a cer- 
tain query. As we will see in Section 4, the Spatial Model Servers are not suitable for 
storing mobile objects. Instead, a Location Sendee [9] is used to manage them. 
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3 The Augmented World Model 

The Augmented World Model is an object-oriented information model, for mobile spa- 
tially aware applications. It’s basic idea is that the user of such an application lives in a 
world of real objects (buildings, streets, cars,...) which is augmented by virtual objects- 
like Virtual Information Towers [11]. These virtual objects are metaphors for external 
information (see Figure 3). 



i 




Mobile user 
in real world 
(Buildings, users...) 



Augmented 
World Model 

(model of buildings, users, 
locations...) 



Location -based external 
information (WWW data, ...) 



Fig. 3. Real and Augmented World 



3.1 Conceptual Elements 

In order to determine which objects should be part of the Augmented World Model, we 
have developed use cases which are typical for mobile, spatially aware applications. In 
this section, we describe the elements of the model and relate them to some of the 
underlying use cases. 

Real World Objects. It is possible to build location-aware applications without mod- 
eling real world objects, just by assigning information to geographical coordinates or 
vice versa. But NEXUS wants to support not only location-aware, but spatially aware 
applications, which are not only aware of where they are, but also of what is in their 
environment. One use case for this is "getting information by pointing at objects": the 
user points with an appropriate device at an object (for example a library) and gets 
information about it (e.g. the homepage of that library). Other use cases require rela- 
tions between objects of the real world and information (e.g. "sticking information on 
an object (Virtual Postlt)", see below). To support these use cases, a detailed model of 
the real world is necessary. Therefore, NEXUS defines real world objects in different 
levels of detail, like rooms, buildings, streets, areas, or cities. 
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In the Augmented World, there are also mobile objects. Examples for mobile 
objects are the user herself, other users or vehicles like busses and trains. A use case 
for this is "getting the current location, direction and speed of a certain mobile object", 
for example of the next bus. 

Virtual Objects. As said before, virtual objects are used as metaphors for external 
information. Examples are Virtual Information Towers, which are used to link external 
information in the Augmented world and are independent of any other objects, and 
Virtual Postlts, which contain information that sticks on another object (which can be 
real or virtual). 

To support navigation, which is probably today’s most common use case, the AWM 
contains special navigation objects (edges and nodes), which are associated to real 
world objects like streets and crossings. 

3.2 Realization 

As mentioned in Section 2, the NEXUS architecture is an open platform similar to the 
World Wide Web, where the data is widely distributed and even search engines cover 
only a small part of the data. So how can we provide this consistent and integrated data 
model? 

The Standard Class Schema. First of all, we have defined a Standard Class Schema. 
This is a set of object classes and their attributes, which are considered fundamental for 
most neXus applications, as described in Section 3.1. All neXus services have to be 
compliant to this definition. In consequence, all applications can query every neXus 
node and ask for the predefined attributes and classes, for example the menu of a res- 
taurant (see Figure 4). 




Fig. 4. Standard and Extended Class Schema 
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The Extended Class Schema. To keep the schema customizable, it is intended to 
adapt the Standard Class Schema. Future NEXUS applications and services will proba- 
bly need further and more detailed objects and therefore might want to extend the 
Standard Class Schema. A simple example to illustrate customization is a new NEXUS 
service that provides information about Italian Restaurants. However there is no class 
for Italian Restaurants in the Standard Class Schema. So the NEXUS service provider 
defines a new class, which inherits from the most similar class in the Standard Class 
Schema (in this case this would be the class "restaurant"), and adds the attributes 
which are missing to the base class. We call the new set of object classes the Extended 
Class Schema. It is a superset of the Standard Class Schema. As seen in Figure 4, there 
can be more than one Extended Class Schema. They all share the object classes of the 
Standard Class Schema. 

Optional Attributes. When defining the attributes for the classes in the Standard 
Class Schema, the defined set of attributes can be either aimed at simplicity or at com- 
pleteness of the model. If we define only the mandatory attributes and leave out all the 
other possible useful attributes, the model would have a poor semantic. For every real 
problem, an Extended Class Schema would be necessary and the idea behind the Stan- 
dard Class Schema would be ill-conceived. On the other hand, if we define all possible 
attributes, every NEXUS service has to provide data for all of these attributes. The effort 
to provide a NEXUS service would be very high which contradicts the idea web-like 
easy integration of new services. To solve this problem, we declared most of the 
attributes optional. A NEXUS service does not have to provide all of the data, but if it 
wants to, the name and the type of the attributes are defined. This increases the interop- 
erability of the data provided by different services while still retaining an easy setup of 
new services. 

Data Exchange. As it has already been described in Section 2, applications query the 
closest available neXus node about location-based information in their surroundings. 
The neXus node distributes the query to appropriate Spatial Model Servers, integrates 
the results and returns the proper neXus objects. In this data flow, the objects of the 
Augmented World Model have to be serialized. We have defined an XML language for 
this purpose, which we call Augmented World Modeling Language (AWML). It is 
comparable to HTML: both data formats define basic semantics using a markup lan- 
guage. In HTML, the underlying model is a model of hypertext documents. It defines 
titles, headings, images, tables, hyper links, and so on. Similarly, AWML is based on 
the Augmented World Model. Therefore it defines tags for neXus objects, their 
attributes and data types, which all have semantics associated with them. 

There is also a query language called AWQL (Augmented World Query Lan- 
guage), that supports object retrieval within the Augmented World Model. In contrast 
to other XML query languages like QUILT [4], AWQL is not used to extract data out 
of complex XML documents; instead, it is an XML representation of simple spatial 
queries to a object oriented model, which can easily be transformed in SQL99 (Spatial 
Part). However our query language is far simpler than SQL which facilitates the task of 
federating different data sources a lot. 
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We have chosen to use XML technology for the data exchange because XML has 
some striking advantages. First of all, it allows optional attributes. This is important 
not only because of the optional attributes in the Standard Class Schema, but also for 
minimizing the data flow. Applications can restrict their queries to the attributes they 
are interested in. Second, it becomes more and more a common standard for data 
exchange. For example the OpenGIS consortium [16] defines an XML representation 
of the Simple Feature Specification named Geography Markup Language (GML) [17]. 
We use this existing standard for representing the geographical objects and attributes 
of the Augmented World. Another advantage of the popularity of XML is the availabil- 
ity of tools and parsers for many platforms. 



4 Components of the Platform 

In the following section the main components of the NEXUS platform (as depicted in 
Figure 2) are described. They deal with the management of the Augmented World 
Model. 

4.1 Spatial Model Server 

Spatial Model Servers store static objects. The idea is to allow anybody to set up her 
own Spatial Model Server containing spatial information she wants to publish, just as 
she may set up her own web server. Like web servers. Spatial Model Servers must be 
connected to the Internet and must provide a certain (here: NEXus-compliant) API, 
which in our case is the neXus Service Interface. 

Augmented Areas. Basically, the union of the content of all Spatial Model Servers 
forms the static part of the Augmented World Model. To allow this union (or parts of 
it) to be exploited in a valid and efficient manner, we introduce the concept of the Aug- 
mented Area within the NEXUS Federation Layer. An Augmented Area represents spe- 
cific meta-data describing the content of its associated Spatial Model Server and can 
best be described by the following features: 

• Each Augmented Area covers a certain area or terrain. This area can be arbitrarily 
chosen, e.g. it may be the area of a single building, a town or a whole country, but it 
is rarely changed. All objects belonging to an Augmented Area lie completely 
inside its boundary. Different Augmented Areas may overlap each other. Shapes 
and positions of the Augmented Areas are stored at the Area Service Register (see 
Section ). 

• An Augmented Area may be restricted to contain only certain types of objects, e.g. 
only Virtual Information Towers or only Italian restaurants. This information is 
also known to the Area Service Register. 

• Each Augmented Area is consistent, i.e. each object has at most one representation 
per Augmented Area. As Augmented Areas may overlap, one object may have sev- 
eral representations within several Augmented Areas. Augmented Areas may coop- 
erate, i.e. provide information about the existence of multiple representations of 
one object. In this case, the Federation Layer can easily select one representation 
suitable to answer the query, based e.g. on the desired level of detail. 
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• All objects belonging to an Augmented Area are stored at the same Spatial Model 
Server (though one Spatial Model Server may store objects of different Augmented 
Areas). Knowing the name of this server allows the Federation Layer to retrieve all 
objects of an Augmented Area. 

Figure 5 illustrates the relationship between Augmented Areas (AA), Spatial 
Model Servers (SpaSe) and the Area Service Register (ASR). AA 2 contains represen- 
tations of three buildings and two streets stored at SpaSe A. AA 3 contains the interior 
(rooms) of one of the buildings; AA 1 a different representation for one building and 
some VITs. 




Area Service Register (ASR). The Area Service Register is a repository that contains 
some information about Augmented Areas: The covered area, the type of objects 
belonging to this Augmented Area and the name of the Spatial Model Server storing 
these objects (see Figure 5). This information is used by the Federation Layer to iden- 
tify the Spatial Model Servers needed to answer a query. Section 4.3 discusses the 
interaction between the Federation Layer, the Area Service Register and Spatial Model 
Servers in detail. 

Building a Spatial Model Server. The APIs of Spatial Model Servers and neXus 
nodes contain a special query language, which we call the Augmented World Query 
Language (AWQL). A Spatial Model Server may use an arbitrary system to actually 
store data as long as it provides a wrapper to AWML. Generally, either typical Geo- 
graphical Information Systems (GIS) or spatially enabled DBMS could be used for the 
implementation. Since it is intended to store huge amounts of data in a distributed 
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environment, spatial databases seem to be more suitable for our purposes. Further- 
more, mediating between AWQL and SQL is easier to be done than translating AWQL 
into the predominantly proprietary interfaces of common GIS. Another reason for pre- 
ferring DBMS is that some Geo-Information Systems do not appropriately support 
mechanisms to communicate within a network in a standardized manner. 

The DBMS that are preferred as Spatial Model Servers within NEXUS should be 
object-relational, i.e. they should provide data types for the storage of spatial objects 
and appropriate methods on them, according to the implementation specifications of 
the OpenGIS consortium [16]. 

Different vendors like Oracle and IBM offer this kind of spatial databases. But the 
products do not yet cover all requirements of NEXUS. For example, the topological 
relations of spatial objects can not be reflected within spatially enabled DBMS. But the 
NEXUS infrastructure has to include algorithms for network analyses like shortest path 
queries and therefore needs to access topological information. Another important crite- 
rion that needs to be fulfilled by the system and which is not yet integrated in 
ORDBMS concerns the storage and handling of multiple representations of spatial 
objects. Also, the integration of 3D geometries into spatial databases is an unsolved 
problem. While developing the NEXUS infrastructure, the questions mentioned above 
will be challenges for further research efforts. 

4.2 Location Service 

Spatial Model Servers are not able to store mobile objects. They lack the capability to 
perform hand-overs of moving objects. In addition to that, objects moving into areas 
not covered by an Augmented Area would be lost. We also would have to assure that 
Spatial Model Servers could cope with frequent updates of positions. 

To circumvent these problems we decided for the current architecture to use a sep- 
arate Location Service as described in [9]. In doing so, this service can be indepen- 
dently optimized with respect to data structures and algorithms for efficiently 
managing the current locations of a large number of mobile objects. 

4.3 The Federation Layer 

The Federation Layer provides a uniform view on the data offered by the neXus ser- 
vices to the neXus applications. It is delimited by two interfaces. Applications post 
their queries and receive the results through the neXus Application Interface, while all 
services like Spatial Model Servers and the Location Service are connected to the Fed- 
eration Layer through the neXus Service Interface. As the Area Service Register is an 
internal component of the Federation Layer it is also accessed through an internal 
interface. Figure 6 shows the basic structure of the Federation Layer. 

The Federation Layer processes any incoming query in the following way. First it 
extracts all spatial predicates out of the query to determine the target region to which 
the query is directed to. Then it asks the Area Service Register for all Spatial Model 
Servers that store Augmented Areas that overlap with the target region. The Area Ser- 
vice Register also supplies the class descriptions of the objects stored on each Spatial 
Model Server. The Federation Layer then decomposes the query into appropriate sub- 
queries for each of the involved Spatial Model Servers using these class descriptions. If 
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mobile objects are involved in the query a corresponding subquery is sent to the Loca- 
tion Service as well. External data like web pages or legacy databases can be integrated 
into the Augmented World Model by building a Spatial Model Server that associates 
the external data with objects in the model. 



Proprietary Interfaces 



neXus Application Interface 




Fig. 6. Structure of the Federation Layer 



Thereafter the partial results of the subqueries are collected and integrated into a 
single consistent result. For providing a uniform view on the data, additional model 
transformation functions, which will be explained below in Section 4.4, have to be 
applied. Finally the integrated result is delivered to the application. 

There is a Federation Layer on every NEXUS node and its capabilities are identical 
on each node. An application can thus send its query to the closest node and receives 
the same result as if it had sent it to any other node. In addition to that, a NEXUS node 
can provide value-added services like a map service that renders rectangular maps or a 
navigation service that calculates a path to a given destination. An application has to 
access these services through their own proprietary interfaces. The services themselves 
use the NEXUS application interface to get the data needed to provide their value-add- 
on. The NEXUS platform allows the integration of further third party services into a 
neXus node. 
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4.4 Model Transformation Functions 

At the level of the Federation Layer, some functions have to be provided that allow the 
handling of spatially related queries in a distributed system and the integration of dif- 
ferent partial models into a single global model. At present these model transformation 
functions can be divided into three main tasks: 

• Homogenization 

• Generalization 

• Aggregation. 

Since all kinds of spatial data can be integrated into the NEXUS system, it is a cru- 
cial prerequisite that the data sets can be used together, i.e. their interoperability has to 
be guaranteed [8]. For this reason, identical objects within different data sets must be 
matched using so-called homogenization or conflation procedures [22]. Very often, 
this problem is difficult to cope with, because the assignment of an object in one data 
set to its correspondent representation in another data set is ambiguous. 




Problem 



Fig. 7. Matching nodes and edges of different data sets (after [22]) 

In Figure 7, data set 1 and 2 represent two topological graphs of some roads. The 
next picture shows an overlay of them. The rightmost picture visualizes the problem of 
matching nodes and edges from different data sets. A matching of the bold edge of data 
set 2 to an edge of data set 1 is ambiguous if only geometry is considered. Thus, also 
topological relations and object attributes have to be analyzed in order to detect corre- 
sponding features. Conflation operations are sometimes limited since appropriate indi- 
cators for the matching process might not be available. 

In terms of interoperability, generalization [1] and aggregation [19] techniques play 
an important role, too. Within the NEXUS context, aggregation defines the merging pro- 
cess of several smaller objects into one object of a higher order, whereas generalization 
is understood to be the mechanism that reduces the details of spatial objects, predomi- 
nantly with respect to geometry [14]. 

Generalization and aggregation functions are necessary within the infrastructure, 
because some data sets might differ concerning the level of detail they are originally 
stored at individual Spatial Model Servers. NEXUS therefore has to find the adequate 
abstraction level such that data of different resolution can be processed together. 
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Since the platform claims to be an open system, it also has to provide a data basis 
on which all kinds of applications can work on. And as different applications need dif- 
ferent resolutions of data, NEXUS has to manage data in multiple representations - 
from coarse grained to highly detailed levels. Thus, generalization and aggregation are 
also needed in case that only detailed data is available for a certain part of the Aug- 
mented World, but an application demands for spatial information in rather low resolu- 
tion (see Figure 8). 




Fig. 8. Fully ground plan map (left) and aggregated view (right) of the same scene. To get an 
overview, the representation on the right is more appropriate. 



5 Example: The VIT Application 

In this section, we describe how a simple neXus application can work with the neXus 
platform. The sample application is described in detail in [11], so we give only a brief 
summary of its functionality. 

5.1 Overview 

A Virtual Information Tower (VIT) is a rough electronic equivalent to real-world 
advertising columns. It is positioned at a fixed geographical location and has a certain 
surrounding area which is called visibility area. The VIT application runs on a portable 
device, equipped with a location sensor (e.g. GPS). It displays a map of the area in 
which the user is currently located, and augments it with the VITs that are visible at 
her location. As the user moves on, the number of VITs is changing: when she leaves 
the visibility area of a VIT, it disappears from the map, and when she enters other visi- 
bility areas, new VITs will pop up. If she clicks on a VIT, the assigned information 
(e.g. a web page) will be displayed. 

With the neXus platform, the VITs can be stored on Spatial Model Servers, which 
can also generate the underlying map. The NEXUS nodes do the work of the VIT man- 
ager. In the following, we describe the data and control flow between the VIT applica- 
tion, a neXus node and some Spatial Model Servers. 
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5.2 Data and Control Flow 

1 . Query closest available node. The application queries the closest available neXus 
node and asks for the graphical representation of all buildings, streets and VITs 
within a certain area around the current position (let’s say, 500 meters). 

2. Find Spatial Model Servers. The NEXUS node selects the appropriate Spatial Model 
Servers. To perform this task, it asks the Area Service Register to return the Spatial 
Model Servers containing Augmented Areas that overlap the area specified by the 
application’s query, and objects of the requested types (buildings, streets or VITs). 
In the example, the result set contains two Spatial Model Servers: SpaSe A, which 
contains buildings and streets, and SpaSe B, which contains VITs and detailed 
information about important buildings. 




3. Query closest available node. 

4. Find Spatial Model Servers. 

5. Query Spatial Model Servers. 

6. Get Results from Spatial Model Servers 

7. Integrate Results 

8. Get Result from the node 



Fig. 9. Data and control flow between VIT application and the platform 

9. Query Spatial Model Servers. The NEXUS node sends the application query to 
SpaSe A and B. If the query would ask for mobile objects, it would query the Loca- 
tion Service for this information (see Section 4.2). The Spatial Model Servers 
receive the query in AWQL. They must now translate it to their local interface, e.g. 
SQL 99 and retrieve the requested objects and attributes. Therefore they have to 
perform geometric functions - in the current example, they have to overlap the 
position of the objects with the requested area. Finally, the result set has to be seri- 
alized in AWML and sent back to the NEXUS node. All mappings between the 
NEXUS service interface and local, i.e, proprietary interfaces are accomplished - if 
necessary - by wrapper technology in the Spatial Model Servers. 

10 .Get Results from Spatial Model Senders. The NEXUS node receives the resulting 
AWML documents from the Spatial Model Servers. 

11. Integrate Results. Now, the NEXUS node must integrate the received documents. It 
homogenizes different levels of detail and chooses the appropriate representation, 
if more than one server delivers information about the same object. After that, it 
combines the results into one single AWML document. 
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12. Get Result from the Node. The V1T application gets the final result from the NEXUS 
node. It can now use the information to visualize a map with buildings, streets and 
VITs. If the user clicks on VIT X, the whole process starts again, this time with the 
query: "give me the detailed information that sticks on VIT X". 



6 Related Work 

6.1 Geographic Information Systems (GIS) 

The neXus infrastructure is not aimed at supporting GIS applications in the common 
sense. It rather provides certain functions that are related to Geographical Information 
Systems, but lacks some sophisticated techniques for spatial analysis and therefore 
cannot be considered as a complete GIS. Missing functions have to be implemented by 
the applications, if they are required. However, the neXus platform could also be ext- 
ended in case that new GIS-related requirements of spatially aware applications are 
evolving. 

There are a number of differences comparing typical GIS applications and the 
neXus platform. Usually, applications of Geographic Information Systems work on 
data sets that are appropriately modelled for their individual purposes. neXus, though, 
is designed as an open platform for spatially aware applications. Therefore, a generic 
data model that can be extended to meet the specific needs of different applications 
forms the basis of the neXus system. Anyone being interested in participating in 
neXus has to obey the rules of the underlying data model and has to adjust its data 
accordingly. 

Moreover, typical GIS are more or less monolithic systems and their applications 
are running on a single computer, dealing only with geographical data in a proprietary 
format. Since neXus is designed to be a highly distributed system, the interoperability 
of the different data sets being located at different servers, is a vital prerequisite. Thus, 
appropriate mechanisms like homogenization, generalization and aggregation have to 
be integrated into the platform (see Section 4.3). These functions are usually not pro- 
vided by today’s GIS. Besides that, advanced techniques to handle geographical que- 
ries involving different Spatial Model Servers have to be offered by the neXus 
infrastructure. This item addresses problems that have to be taken into account at the 
level of the federation, dealing with data localization, development of query strategies 
(parallelization) and cost estimation [23]. 

6.2 Federated Information Systems 

The neXus platform can be classified as a federated information system (FIS). It pro- 
vides a federated access to all data sources that are connected to the platform, i.e. that 
are registered at the Area Service Register. The Augmented World Model serves as a 
federated global schema through which this federated access is accomplished. As the 
result of the analysis of a number of characteristic use cases, the model is constructed 
top-down satisfying a global information need. The platform is more than just a feder- 
ated database system [20] as it is not integrating a fixed and a priori known set of data 
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sources into a global schema. Thus no bottom-up schema integration techniques like 
proposed in [3], [12] are used. Servers offering information can be integrated into and 
removed from the platform at any time. 

According to [2], a top-down design method implies a loose coupling between the 
components of a FIS, especially if the components are constantly changing. Moreover 
a loose coupling also means data loss between components because of conversion 
steps like aggregation which renders a write access to the data impossible. In our 
approach, however, the Standard Class Schema of the Augmented World Model pro- 
vides a global schema to which all component systems participating in the federation 
have to conform to. Each object in the model has a unique identifier which, in the gen- 
eral case, allows to update its attributes, if access restrictions permit this. Updates are 
necessary for example to enable sensors to propagate the current state of their environ- 
ment into the model, like the position of the user, or to let the user control devices in 
the real world, like a light switch. 

New object instances and even new object types can be inserted into the model as 
long as they are derived either directly or indirectly from a class in the Standard Class 
Schema. The Federation Layer determines a suitable server to store a new object by 
looking at the type and location of the object. An example for an object to be inserted 
into the model is a virtual Postlt. 

Usually an application does not need to know on which server a certain object is 
stored, the NEXUS platform handles this transparently. Objects from different servers 
can be retrieved in a single query. The NEXUS platform thus provides not just an inte- 
grated access to all data sources connected to it, but a federated one where location 
plays the key role. 

Compared to the TSIMMIS project [5], [18], [6] the Augmented World Model can 
be seen as an integrating view which features object fusions through homogenization 
and abstractions through aggregation and generalization. The Federation Layer can be 
compared to a very thick mediator that forwards incoming queries to all component 
systems that are involved in the processing of a certain query. Wrappers can be used to 
provide an AWQL/AWML interface for off-the-shelf spatial databases or other data 
sources like the web or legacy databases. 

6.3 Related Projects 

The Cyberguide project [10] mainly focused on developing a virtual tour guide appli- 
cation that provides location-based information and navigation to a user. The Cyber- 
guide system is however not maintaining a detailed model of the real world and it is 
not aimed at integrating with existing information services. 

The DeepMap project [13] is developing a virtual tourist guide as well and is very 
much concentrated on the tourist as a potential user of the system. It is focused on pro- 
viding location-based information, but it is not spatially aware, i.e. it has no detailed 
model of the user’s surrounding. It is aimed at outdoor use and an integration with 
indoor information systems is not planned. 

The WorldBoard project [21] links existing web-pages to any location on earth. 
The WorldBoard system provides only location-based information, but it does not deal 
with navigation. It does not employ a model of the world either. 
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7 Conclusion and Future Work 

In this paper we have presented the architecture of the NEXUS platform. It provides an 
open global infrastructure for spatially aware applications. The spatial awareness of 
the NEXUS applications makes them easy to use also for inexperienced users. New 
information services can be plugged into the platform as easy as publishing a web 
page. 

We have introduced the Augmented World Model as a semantically rich object-ori- 
ented model of the real world. Its Standard Class Schema defines basic object classes 
for static, mobile and virtual objects which have optional attributes to simplify the 
introduction of new services while still defining a common format for detailed infor- 
mation. Object classes can be further extended by applications as needed. 

By means of a detailed explanation of the main NEXUS components (Spatial Model 
Server, Augmented Area, Area Service Register) and their interaction patterns, we 
showed that the NEXUS platform is quite different in comparison to available geo- 
graphic information systems (GIS) on one side, but also different to federated informa- 
tion systems on the other side. It can be characterized as an open platform to support 
spatially aware applications, just like WWW servers support web-based applications. 
Instead of the neutral HTML we exploit a semantically enriched AWML/AWQL, and 
instead of leaving the integration of different web services up to the application, we 
support a mediation of the services by means of our Augmented World Model and the 
available NEXUS interfaces (NEXUS Application Interface and NEXUS Service Inter- 
face). This mediation and federation approach can be contrasted with what is called 
Enterprise Application Integration, but with respect to NEXUS the focus is on spatial 
awareness issues. In the NEXUS platform, the middle tier allows to support value-added 
services like in our case a map service or a navigation service that exploit the inte- 
grated data view provided by the Federation Layer. 

Currently the NEXUS project consists of four departments at three institutes at the 
University of Stuttgart. We are building a small-scale prototype of the platform to dem- 
onstrate the basic features of the NEXUS system. Future work will focus on one side on 
refining the internal components and their interaction in order to enhance performance. 
On the other side, there is more work to do in developing more comprehensive applica- 
tions. 
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Abstract. The development of spatiotemporal database systems is primarily 
motivated by applications tracking and presenting mobile objects. Another 
important trend is the visualization and processing of spatial data using XML- 
based representations. Such representations will be required by Internet appli- 
cations as well as by location-based mobile applications. In this paper, an 
architecture for supporting queries on XML-represented moving objects is 
presented. An important requirement of applications using such an architecture 
is to be kept informed about new, relocated, or removed objects fulfilling a 
given query condition. Consequently, the spatiotemporal database system must 
trigger its clients by transmitting the required information about the relevant 
updates. Such queries are called continuous queries. For processing continuous 
queries, we have to reduce the volume and frequency of transmissions to the 
clients. In order to achieve this objective, parameters are defined, which model 
technical restrictions as well as the interest of a client in a distinct update opera- 
tion. However, delaying or even not transmitting update operations to a client 
may decrease the quality of the query result. Therefore, measures for the quality 
of a query result are required. 



1 Introduction 

The research activities in the field of spatial database systems have been shifted to the 
investigation of spatiotemporal database systems. Projects like "Chorochronos" [21] 
and a sequence of workshops (e.g. [10], [24], [25]) are indicators of this trend. The 
main topics of interest are the structure of time and space, data models and query 
languages, the efficient processing of queries by using adequate storage structures and 
database architectures, and the design of graphical user interfaces for spatiotemporal 
data. The investigation of spatiotemporal database systems is especially motivated by 
applications, which require to track and to visualize moving objects. Many of these 
applications originate from the field of traffic telematics, which combines techniques 
from the areas of telecommunication and computer science in order to establish traffic 
information and assistance services. Such applications ([33], [1]) require the 
management of moving objects, e.g., of vehicles of a fleet. 
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Another trend is the visualization and processing of spatial data via the Internet 
using XML. Current geographical information systems (GIS) or systems just 
announced (e.g. [9], [23]) (will) support the XML-based Geography Markup 
Language (GML) [15]. The next step will be the support of mobile applications. Now, 
the (intermediate) standard GPRS (General Radio Packet Service) is being launched 
in Europe. By introducing the new mobile telephone standard UMTS (Universal 
Radio Packet Service), this trend will be dramatically enforced. One common proper- 
ty of mobile applications is that they refer to locations and, therefore, require the 
transmission of spatial or spatiotemporal data. The appearance of mobile applications 
has also an impact on the devices used for presenting data: In addition to personal 
computers and workstations, personal digital assistants (PDAs) or mobile telephones 
are used as Internet clients. However, the computing power of such devices is rather 
restricted compared to traditional computers. In addition, the speed and throughput of 
wireless networks are limited and are subject to large variations. 

The combination of these trends requires a suitable architecture for the XML-based 
representation and visualization of moving spatial objects. This architecture consists 
of the chain beginning at the client, which is responsible for the visualization of the 
spatial objects and for the user interaction, over suitable middleware up to the spatio- 
temporal database system. A special task of such an architecture is to support clients 
differing in the technology used for presenting and transmitting data in order to 
support traditional Internet applications as well as location-based mobile applications. 

Applications tracking and presenting mobile objects require to be kept informed 
about new, relocated, or removed objects fulfilling a given query condition. Con- 
sequently, the spatiotemporal database system must trigger its clients by transmitting 
the necessary information about such update operations. The query, which causes this 
process, is called continuous query [28]. The result set of a continuous query is not 
only influenced by the update operations occurring in the database and by the given 
query condition, but - especially for mobile applications - also by the capability of 
the client to process the result. Typically, technical restrictions limit the computing 
power of the clients, the maximum resolution of spatial distances the client is able to 
distinguish, and the maximum speed and throughput of the connection between the 
database system and the client. Therefore, it is not advisable to transmit the complete 
result of a continuous query. Instead, a reasonable filtering must be performed. How- 
ever, delaying or even not transmitting update operations to a client may decrease the 
quality of the query result. Therefore, indicators for the quality of the result of a 
continuous query are required. 

The paper starts with a description of the architecture for querying XML-repre- 
sented moving objects. The components of the architecture are discussed; a special 
attention is given to the integration of representations like GML and SVG (Scalable 
Vector Graphics). In Section 3, a short definition of the data model describing the 
spatiotemporal objects is given. After a classification of update operations occurring 
in a spatiotemporal database system, a formal definition of the result set of the 
continuous query is presented in Section 3.4. In order to reduce the volume and fre- 
quency of transmissions to the clients, corresponding parameters are defined and inte- 
grated into the architecture in Section 4. Measures for the quality of the results after 
applying such restrictions are presented in section 4.4. Section 5 gives an overview of 
related work. Finally, the paper concludes with a summary and an outlook on future 
work. 
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2 Architecture for Querying XML-Represented Moving Objects 

In this section, an architecture for querying XML-represented moving spatial objects 
is presented. The diagram in figure 1 gives an overview of the basic components and 
the dataflow within this architecture. The single components are described in the rest 
of this section. 





Fig. 1 . An architecture for supporting an XML-based representation and visualization 
of moving spatial objects. 



2.1 The Client 

The main task of a client is the presentation of the data and the interaction with the 
user. From the point of view of a database system, the clients formulate the queries 
and take the query results. Another task of the client is the communication with the 
Internet server. 

In the following, the visualization of moving spatial objects, typical queries, and 
the communication of the clients with the Internet server are discussed. 

2.1.1 The Visualization 

By using TV consoles, PDAs, or mobile telephones in addition to PCs and 
workstations as Internet clients, the properties of clients differ very much. This is 
caused by using different Internet browsers, operating systems, and hardware. These 
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differences require using different techniques for presenting the data. Spatiotemporal 
data are primarily represented by vector data. However, there exists no general 
standard for representing vector data in the Internet. Typical solutions of this problem 
in the field of geographical information systems (GIS) are the following: 

• The usage of raster maps, which are dynamically generated by the server site (see 
e.g. [5]). A restricted functionality of the client and a high data volume are the 
consequences. 

• The second solution is using additional programs that extend the functionality of 
the Internet browser. Depending on the field, where the application is used, 
Active-X components, plug-ins, or applets can be found. Especially, occasional 
users of Internet applications presenting maps are deterred by such solutions [2], 

• Comparable to the second solution is the use of proprietary data formats like 
Macromedia Flash [12], which have cartographic restrictions [14]. Furthermore, 
they are not standardized. 

The goal is a data format for vector data, which is flexible as well as standardized. 
Therefore, an obvious solution is using an XMF-based data representation. A suitable 
candidate is the graphical standard SVG (Scalable Vector Graphics). Version 1.0 of 
SVG has currently the status of a candidate recommendation of the WWW 
consortium [32]. It has many features that allow representing and visualizing 
cartographic information [14]: 

• SVG supports vector data as well as raster data. All essential geometric features are 
supported. 

• SVG allows grouping objects, i.e., the definition of layers. 

• Focal coordinate systems are supported by transformations. Spatial objects can be 
clipped. 

• An extensive and flexible formatting is supported by cascading stylesheets (CSS2). 

• Event handling can be embedded into a graphic by using scripts (JavaScript). 

In order to use a graphical data format (and the corresponding viewers) for visualizing 
moving objects, update features with respect to the already visualized objects are 
required. Inserting new objects as well as removing and updating existing objects 
(especially with respect to their position but also with respect to their shape or other 
properties) are operations, which must be supported by the client (see also section 
3.2). SVG allows such modifications by permitting a complete access to the corre- 
sponding document object model (DOM). This object tree can be modified by em- 
bedded scripts (e.g. written in JavaScript). Figure 2 depicts an example where moving 
objects are visualized by an SVG viewer. 

For standard Internet browsers it can be expected that they will soon support SVG 
without requiring additional plug-ins. However, for devices like PDAs or mobile tele- 
phones, this cannot be expected. Furthermore, the power of such “micro viewers” 
may be too limited for supporting moving objects. Therefore, the architecture must be 
flexible in order to support other data formats for such devices. 
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Fig. 2. Visualization of moving vehicles on the street map of the City of Oldenburg by 
Adobe’s SVG viewer. The moving objects have been created by a data generator [3]. 



2.1.2 The Queries 

In principle, we can distinguish the following types of queries: 

• Standard database queries without any temporal or spatial query conditions. 

• Spatial queries having one (or more) spatial query condition(s). This type of 
queries can be transformed into a set of basic spatial queries like window queries, 
point queries, and spatial joins. Well-known indexing and optimization techniques 
of spatial database systems can be used for performing this type of queries. For an 
overview see [7] . 

• A spatiotemporal query consists of temporal and of spatial query conditions. For 
optimizing such queries, a spatiotemporal database system is required. The result 
set of a spatiotemporal query typically consists of the objects existing at one 
distinct time stamp or at a period described by an interval of time stamps. Further- 
more, the objects fulfill spatial conditions, e.g., they intersect a given query win- 
dow. A spatiotemporal query may consist of additional query conditions without 
any spatial or temporal criteria. Temporal queries without any spatial query con- 
dition may also exist, but they are not typical of a spatiotemporal database system. 

• For depicting moving objects, a variant of a spatiotemporal query is required: its 
temporal query condition consists of a time interval between the current time and 
the unlimited future. In this case, all current objects fulfilling the other query con- 
ditions are transmitted from the database system to the client. Then, a continuous 
update of the client is required: new, relocated or removed objects are detected by 
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the database system and are - as soon as advisable or necessary - transmitted to the 

client. The treatment of such continuous queries is of special interest to this paper. 

The formulation of spatiotemporal queries requires a foundation for representing the 
objects, their attributes and the query conditions. A fundamental work for the 
representation of spatiotemporal objects is the paper of Gtiting et al. [8] where 
spatiotemporal data types and operations are presented and embedded into a query 
language. An adaptation for a discrete data model can be found in [6], For using such 
results in an XML-based environment, a corresponding XML notation is required. In 
figure 1, this is indicated by the notation „spatiotemporal XQL“. 

2.1.3 The Communication with the Internet Server 

The processing of the queries requires a communication with the Internet server. 
Typically, the transmission of data from the server to the client is initiated by the 
client. However, the processing of continuous queries requires server-initiated data 
transmissions. This is an untypical operation and cannot be found in comparable 
server architectures. A client does not know when and how many objects are changed 
in the spatiotemporal database. On the abstract level that means that the roles of the 
server and of the client have changed. 

For updating clients typically server-push technology is used [13]. Such a 
pushing is based on a stable TCP connection and allows an update by transmitting 
complete HTML pages. One disadvantage of this technique is the high number of 
active connections to the server. For applications depicting spatiotemporal objects, 
transmitting complete pages is unsuitable because generally only a small part of the 
depicted objects has been modified between two updates. For example, in a map 
where moving vehicles or traffic jams are depicted, only some of these objects may 
change but not the rest or the background like the street map. A complete 
transmission would be a waste of time; especially for mobile applications, this 
solution cannot be applied. 

A solution of this problem is to start a server thread at the client site. This can be 
done by a small applet, which registers the client at the Internet server. This mechan- 
ism allows an identification of the client after an HTTP request is finished. This iden- 
tification, e.g. the IP address of the client and an assigned port number, are stored in 
an administrative database. In figure 1 , the server thread at the client site is indicated 
by the “communicator” within the box of the Internet browser. 



2.2 The Internet Server 

The Internet server is a standard HTTP server handling the page requests of the 
clients. Additionally, it passes the queries of the clients to the map server and trans- 
forms the query results into the client-specific data representations considering par- 
ticular style formats. 
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2.2.1 Processing of the Query Results 

As mentioned before, different types of clients may require different data formats. We 
assume that these representations are described by XML. On the other hand, the other 
components of the architecture should not handle several data formats in order to 
minimize their complexity. Instead, a uniform representation (also described by 
XML) is reasonable to use. For the representation of spatial objects, the OpenGIS 
consortium (OGC) proposed an XML-based specification, the Geography Markup 
Language GML [15]. In contrast to SVG, which is a format for displaying vector data, 
GML allows characterizing the properties of geographic features. The GML definition 
must be extended for representing properties, which are specific for moving spatial 
objects. This extension is called temporal GML in the following; it is the data format 
used for the data provided by the map server. 

For converting temporal GML into another XML representation, the usage of XSL 
transformations is suitable. The transformation part of XSL (Extensible Style 
Language) (XSLT) [31 ] converts an XML document into another XML document by 
using XSL stylesheets. For supporting different data formats, it is only necessary that 
the Internet server provides suitable XSL stylesheets. An example for an XSL 
stylesheet converting GML into SVG is depicted in figure 3. The integration of the 
XSL transformers into the Internet server may be done by Java servlets, Java Server 
Pages (JSP), or by Active Server Pages (ASP). 

In principle, it is possible to perform the XSL transformation on the client site, e.g., 
by the Internet browser. (For example, Microsoft’s Internet Explorer shows quite 
reasonable results after updating Windows on the MSXML version 3.0.) However, 
for devices of low performance as well as for devices receiving data by (slow) 
wireless connections, a transformation at the server side is preferable. Additionally, 
by using different CSS stylesheets, a client-specific or even a user-specific formatting 
of the data can be achieved. 

In comparison to binary data, XML documents considerably increase the required 
data volume. However, this problem is mostly compensated by high compression 
rates. Preliminary results show that a document describing a large set of spatio- 
temporal objects will be compressed by a factor of about 20 if a GML representation 
and ZIP compression are used. For SVG, a factor of about 7.5, and for a binary 
format, a factor of about 2.5 have been achieved. 



2.3 The Map Server 

The main task of a map server is to coordinate the co-operation of the database sys- 
tem(s) and the Internet server. In commercial solutions for managing spatial data, 
(one of) the database system(s) is often replaced by a GIS. The map server passes the 
queries to the relevant database system(s) and processes the query results. A spatio- 
temporal database system manages the data according to spatial and to temporal as- 
pects. Therefore, it cannot be assumed that the data in the database is (hierarchically) 
represented by XML or that the database system can be queried by an XML-based 
query language. The map server transforms the queries into the database-specific 
query language and converts the results into temporal GML. 
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GML fragment describing a segment of a street (according to GML version 1 .0) 



<Feature typeName= "StreetEdge3 " > 

<name>734407488</name> 

<property typeName= " streetname" type= " string" >Heideweg</property> 
<geometricProperty typeName= " location" > 

<LineStringxcoordinates>8961 , 8839 8894 , 8 710< /coordinates >< /Lines tring> 
</geometricProperty> 

</Feature> 



Corresponding SVG fragment 



<polyline class="c3" style="stroke-width: 25 ; " points= " 8961 , 8839 8894,8710"> 
</polyline> 



Part of the XSL stylesheet that transforms the GML fragment into a corresponding SVG fragment 



<xsl : template match= " f eatureMember [starts -with (@typeName, 1 StreetEdge ' ) ] " > 

<xsl : element name= "polyline "> 

<xsl : attribute name=" class" > 

c<xsl : value-of select= " substring-after (@typeName , 'StreetEdge') "/> 

</xsl : attribute> 

<xsl : attribute name= " style" > 

stroke-width: <xsl : value-of select="$revscale"/ >; 

</xsl : attribute> 

<xsl : attribute name= "points "> 

<xsl : value-of select= " Feature/geometricProperty/LineString/ coordinates " / > 
</xsl : attribute> 

</xsl : element> 

</xsl : template > 



Fig. 3. Example for transforming GML into SVG by using XSL transformations. 



2.4 The Spatiotemporal Database System 

The final component of the architecture depicted in figure 1 is the spatiotemporal 
database system. It stores the non-moving spatial objects as well as the moving 
objects. Our architecture supports especially querying spatial and spatiotemporal data; 
these queries are processed by the spatiotemporal database system. 

The spatiotemporal database system serves one or more clients requesting data 
from the database. Changes of the database are performed by processes running 
outside of the described architecture (indicated by the symbol “World” in figure 1). 
Database triggers are used for initiating the update of the clients. 

According to Sellis [21], the main topics of interest in the area of spatiotemporal 
databases are related to the structure of time and space, to data models and query 
languages, to the efficient processing of queries by using adequate storage structures 
and databases architectures, and to the design of graphical user interfaces for spatio- 
temporal data. An important aspect of the implementation of spatiotemporal database 
systems has been the development of suitable storage structures and indexing tech- 
niques for optimizing the processing of spatiotemporal queries; examples for such 
publications are [29], [30], [11], and [17], 

In the rest of this paper, we will concentrate the discussion on the continuous 
query. 
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3 Continuous Queries 

After a short definition of the model used in this paper for describing spatiotemporal 
objects, a classification of the update operations and the definition of the result set of 
a continuous query are presented in this section. 



3.1 The Model of Spatiotemporal Objects 

The following discussion assumes an architecture consisting of a spatiotemporal 
database system, which stores the positions and other attributes of moving objects. In 
a temporal database, valid time and transaction time are distinguished. The valid time 
describes the time when the data were valid or are valid in the modeled reality. The 
transaction time is the time, when the data were committed in the database system. 

In the following, we assume that a moving object obj having an object identifier 
obj.id is described by a sequence of records obj i consisting of a spatial location 
obj Joe (short: loc .), a time stamp objJime (short: time) giving the beginning of the 
valid time, a time stamp objJrTime (short: trTime) giving the transaction time, and 
non-spatiotemporal attributes obj r attr (short: attr) ( i N). A time stamp t is 
represented by a natural number (t N). If the records obj i und obj M exist, the valid 
time of record obj i corresponds to the time interval [time , , time i+ j). If no record obj M 
exists for a record obj., the record obj. is the current state of the corresponding moving 
object from the point of view of the database system. Furthermore, a final record obj t 
may exist with i > j for all records obj. of this spatiotemporal object. A final record 
obj i indicates the end of the lifetime of the object, e.g., by an empty location. In order 
to simplify the discussion, we assume that time i trTime . holds. 

This form of representing spatiotemporal objects is only one option; it can be re- 
placed by another model because of conceptual reasons [8] or because of implemen- 
tation purposes [6], 

3.1.1 Describing Spatiotemporal Objects in GML 

The building blocks of GML are so-called simple features, which “describe the 
geography of entities of the real world” [15]. A feature in GML (version 1.0) may 
consist of a name, a description, a bounding box, several (alphanumeric) properties, 
and several geometric properties. For realizing the above data model in GML, we 
have to represent those attributes by related elements. The object identifier can be 
mapped to the element name and the locations can be described by the geometric 
properties of GML. The attributes time and trTime are of special interest. One 
possibility is to represent them by (alphanumeric) properties. A property element is 
defined as follows in GML: 



<! ELEMENT property (#PCDATA) > 

< ! ATTLIST property 

typeName CDATA #REQUIRED 

type ( boolean | integer | real | string ) "string" > 
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That means we can define the types of properties without changing or extending the 
DTD (document type definition) of GML (version 1.0). 



3.2 Types of Update Operations 

At one time stamp, one or more objects may be updated. We can distinguish three 
types of updates concerning a spatiotemporal object obj : 

1 . The insertion of a new object: 

In this case, no record having the same object identifier obj. id existed before. 

2. The modification of an existing object: 

In the general case, any attribute of the object may be changed, i.e. at least one 
record having the object identifier obj. id and an older time stamp is already stored 
in the database. In this paper, we are especially interested in relocated objects. 
Using the model described in section 3.1, a modification requires the insertion of a 
new record into the database. 

3. The deletion of an existing object: 

In this case, at least one record having the object identifier obj. id with an older 
time stamp is already stored in the database. The end of the lifetime of the object is 
indicated by inserting a corresponding record into the database. 

In principle, each of these update operations must notify all concerned clients in order 
to update them. 



3.3 Classification of Update Operations 

A simple solution for performing a continuous query is to inform each client about 
every update. However, this is a very inadequate and expensive solution. Especially 
for the case where the client is an Internet client, such a solution should not be 
considered. 

Determining the interest of a client in an update operation requires the evaluation 
of the object representation at two consecutive time stamps. This leads to the 
following classification of updates. An overview of this classification is given in 
figure 4. 

Class 1: Updates of High Interest 

This class of updates represents operations, which are of high interest to a client, 
because they represent essential changes. The class consists of the following update 
operations: 

• (II) Insertion of a new object fulfilling the query condition of the client. 

• (D 1 ) Deletion of an object, which has fulfilled the query condition of the client at 
the previous time stamp. 

• (12) Modification of an existing object which fulfills the query condition of the 
client now but which has not fulfilled the query condition at the previous time 
stamp. 
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• (D2) Modification of an existing object which does not fulfill the query condition 
of the client now but which has fulfilled the query condition at the previous time 
stamp. 

With respect to the client, the two operations (II) and (12) are insert operations and 
the two other operations (Dl) and (D2) are delete operations. 

Class 2: Updates of No Interest 

This class consists of the updates, which are of no interest for a client: 

• Insertion of a new object not fulfilling the query condition of the client. 

• Deletion of an object, which has not fulfilled the query condition of the client at the 
previous time stamp. 

• Modification of an existing object which neither fulfills the query condition of the 
client now nor at the previous time stamp. 

A client should not be informed about updates of class 2. Therefore, a component 
must exist, which eliminates these operations with respect to distinct clients. 

Class 3: Updates of Medium Interest 

There exists one operation not belonging to the two classes described above: 

• (Ml) Modification of an existing object, which fulfills the query at the previous 
time stamp as well as now. 

With respect to a client, operations of this class are (the only) operations modifying an 
existing object. In general, the number of these updates considerably exceeds the 
number of operations of high interest. A client is of course interested to be informed 
about these modifications. However, if in the sequence of modifications of low rele- 
vance some are skipped, this will be acceptable for a client. Therefore, the operations 
(Ml) are called updates of medium interest in the following. 
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Fig. 4. Types of update operations. 



3.4 Definition of the Result of a Continuous Query 

Using the model of spatiotemporal objects of section 3.1 and the classification 
described above, we can define the result of a continuous query q with a query 
condition c for a client. In the following, ts denotes the database time when the query 
is made, ts+n the time when the execution of the query is stopped, and res k is the 
result of the query at database time ts +k: 
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(1) res 0 = { (obj„ 1 1 ) | obj, fulfills c rime, ts ( timej with j i timej < time,)} 
res k = { (objj, II ) | obj; fulfills c trTimej = ts+k ( i = 0) } 

{ ( objj , 12) | obji fulfills c (objj.j fulfills c) trTimej = ts+k } 

{ (objj, Dl) | objj indicates end of lifetime objj.j fulfills c trTimej = ts+k ) 

{ (obj h D2) | ( objj fulfills c) objj.j fulfills c trTimej = ts+k ) 

{ (obj,. Ml ) | obj, fulfills c objj.j fulfills c trTimej = ts+k } fork {1 

The set res 0 corresponds to the initial result of the continuous query. The results res k 
(k>0) contain the incremental updates and should be transmitted to a client at database 
time ts+k. However, this is an unrealistic assumption. Therefore, this definition is 
only a set target that cannot be achieved. In section 5, we will use this definition for 
defining a measure of quality. Suitable parameters for restricting the results of a 
continuous query are presented in the next section. 



4 Filtering the Results of a Continuous Query 

Technical restrictions limit the computing power of the clients, the maximum 
resolution of spatial distances the client is able to distinguish, and the maximum speed 
and throughput of the connection between the database system and the client. 
Therefore, it not advisable to transmit each result set res k of a continuous query. 
Instead, a reasonable filtering must be performed. 

In order to regulate this filtering, parameters are required for controlling the 
process dependent of the (type of) client. In this section, parameters restricting the 
time and the space dimension are presented. Then, a measure of interest is discussed, 
which allows indicating the interest of a client in an update operation. Finally, a 
component for filtering the query results is integrated into our architecture. 



4.1 Restricting Parameters 

The general demand of a client is to be informed about any interesting update as soon 
as possible. However, as mentioned before technical restrictions may prevent to fulfill 
such a request. For continuous queries, restrictions with respect to the time and to the 
spatial dimension are of special interest. 



4.1.1 Time-Oriented Parameters 

A client is generally connected to the database system by a network. Using the 
Internet or a wireless communication, the data volume, which can be transmitted, is 
restricted. A frequent update of a client using a slow connection and / or a connection 
of restricted capacity is unnecessary. Quite the reverse, too fast or too voluminous 
data transmissions lead to data jams preventing a contemporary update of the client. 
Another aspect is the computing power of the client. Thus, the number of updates to 
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be sent is restricted. On the other hand, a minimum data volume may exist which 
should be sent in order to use the capacity of the data connection and/or of the client. 
The following three parameters describe these restrictions: 

• In order to simplify the considerations it is assumed that all operations require the 
same transmission volume and the same computing power by the client. Then, the 
parameter maxOps describes the number of update operations, which can be sent to 
a client per a single data transmission at maximum. Another parameter minOps 
denotes the minimum number of operations reasonable be sent. It holds: ininOps 
maxOps. 

• Furthermore, in order to support, e.g., wireless data connections, we assume that 
there exists a minimum time period minPeriod between two transmissions to a 
client. If update operations were sent to the client at database time t , the next 
operations should not be sent before t+minPeriod. 

4.1.2 Space-Oriented Parameters 

Applications depicting spatial data have in general a maximum resolution, i.e. there 
exists (in the two-dimensional case) a minimum distance which is distinguishable or 
of relevance. If the movement of an object is smaller than the minimum x-distance 
minDist x and the minimum y-distance minDist , it is unnecessary to inform the client 
about this movement. Using these two distances, we can define the function 
isRelevant as follows: 

(2) isRelevant (obj prev ,obj curr ) := x-distance (loc curr ,loc prev ) minDist x 

y-distance (loc curr ,loc prev ) minDist y 

All the parameters described above differ for distinct (types of) clients. 



4.2 Measure of Interest 

In the former section, we defined parameters in order to filter the operations to be sent 
to the client. Consequently, we must select a subset of operations to be transmitted 
from the set of all update operations of interest. For performing this filtering, we need 
a selection criterion. In section 3.3, we have already classified the operations in two 
relevant subsets: class 1 consisting of operations of high interest and class 3 of 
operations of medium interest. 

For class 1, we can formulate the following requirement: A client should be 
informed about the update operations of class 1 as soon as possible. If not all update 
operations of class 1 can be sent to a client, the transmission should start with the 
operations having the oldest time stamps (with respect to the valid time). The other 
operations of this class must be buffered by the database system and are transmitted 
(if possible) by the next transmission(s). 

Typically, the number of updates of class 3 considerably exceeds the number of 
operations of high interest. Therefore, it is necessary to consider the update operations 
of class 3 in more detail. The interest of a client in such an update depends on 
different factors. 

In order to describe the interest of a client in an update operation, we need a 
measure of interest. The interest depends especially on the last description the client 
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has received. This object description is called obj lasl in the following. The current 
description of the object is called obj curr . The measure of interest intr( obj last , obj ca J can 
use the attributes time, loc, and cittr representing the valid time, the location, and the 
non-spatiotemporal attributes of the object description, respectively. The measure of 
interest may change in an arbitrary manner between two succeeding update 
operations. 

For example, a measure of interest may be computed as follows (dist denotes the 
Euclidean distance): 

(3) intr(obji as „ obj curr ) := (time curr - time last ) + w loc * dist (loc curr , Ioc, ast ) 

A suitable factor w loc is required for equalizing the influence of time and space, 

4.3 Integration of a Component for Filtering Objects 

Computing the measure of interest requires an efficient computation of the last object 
descriptions the client has received for deciding, which update operations of class 3 
are of relevance to a client. If the relevance exceeds a given threshold, the object 
representation will be sent. If the number of records exceeds the maximum allowed 
number, some operations have to be buffered.. 

The component responsible for determining the objects to be transmitted and for 
buffering the candidates, which may be sent at the next transmission(s), is th e filtering 
component of our architecture. In figure 1, it is depicted as a part of the 
spatiotemporal database system. 



5 Measure of Quality 

A measure of quality can be used for two purposes: 

1. For determining the quality of a filtering algorithm. Then, an assessment is 
possible which of two or more algorithms shows the best behavior in the same 
situation. 

2. For controlling the behavior of an algorithm running in the filter component. If 
the quality becomes too low, for example, the parameters presented in section 4. 1 
may be changed. 



5.1 Definitions of Restricted Result Sets 

For the definition of a measure of quality, two further result sets of a continuous 
query are defined. A sequence res will be called a complete result of a continuous 
query q if the following conditions hold: 

(4) res’o res 0 

res' k {(obji, II) RES’(k) \ obj, fulfills c ts trTime , ts+k (i = 0) } res 0 
{ (obji, 12) RES’(k) | obji fulfills c (obj^i fulfills c) ts trTime ,■ ts+k } 
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{ (objj, Dl) RES’(k) | objj indicates eol objj.j fulfills c ts trTimej ts+k J 

{(objj, D2) RES’(k)\ (oZy, fulfills c) oZy',_; fulfills c ts trTimej ts+k } 

{(obji. Ml) RES'(k) | obj, fulfills c objj.j fulfills c ts trTimej ts+k} 
for k { 1, with n’ n 

n n' 

with u res m = u res' m and 

m=0 m-0 

(obj;, op;) res’ k (objj, opj) RES’(n’+ 1) (objj, opj) RES’(k) for / < i 

k 1 

where RES'(k)is defined by RES'(k) := 

m-0 

Again, the set res’ 0 corresponds to the initial result of a continuous query and the 
results res k (k> 0) contain the incremental updates, which are transmitted to the client 
at database time ts+k. The union of all subsets res k of definition (1) is identical to the 
union of all subsets res\ of definition (4). Furthermore, the order of operations 
(obj^ op) is not altered for a spatiotemporal object obj. 

A sequence res ”, will be called a consistent partial result of a continuous query q if 
the following conditions hold (res\ is defined in equation (4)): 

(5) res" k ’ res’ k fork {0 n’ } 

with (objj , opd res" k (objj , opd res’ k 

(opj {11,12} (objj, Ml) res’ k (objj , opd res'j withyci) 

(oppMl (objj, op’j res\ (objj,o pj) res’j with j<i, opj {11,12}, 

opj {D1,D2} ) 

and (objj, opd res” k (objj, opj) RES”(n’+ 1) (objj, opj) RES”(k) for j < i 

and (objj, op; {D1,D2,M1}) res’j (objj , opj {11,12}) RES”(k ) with j<i 

and (obji , opt {11,12}) RES”(n’+ 1) 

(objj , opj {D1.D2}) RES”(n’+\) objj fulfills c at ts+k with / > i 

k 1 

where RES”(k) is defined by RES"(k) := \^]res" m 

m-0 

In the result set of (5) some operations may miss - in the extreme case all subsets are 
empty. However, it is guaranteed that for each modification or deletion a previous 
insert operation exists and that for each object existing in the result set also a delete 
operation is included if the object has been deleted in the database before ts+n ’. The 
operator ’ is used instead of for supporting the situations that an insert operation 
and a succeeding modification are combined to one insert operation or that a pair of 
succeeding delete and insert operations is combined to one modification. These cases 
are covered by the first implication of (5). 
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5.2 Measure of Quality 



As mentioned in Section 3.4, the result of a continuous query can be used for defining 
a measure of quality. Comparing the real result with that result set, some update 
operations may be sent to the client after some delay. In this case, we can define the 
average delay as follows: 



(6) avDelay := k ' kwith(obj i ,op) 

k '= 0 ( obj iy op ) res' k , 



res ' k 



/ 



U res 'r 



| t '=0 



In the consistent partial result, some operations (obj r opj) may miss. The larger the 
temporal and the spatial distance of (obj r op s ) to its “neighbored” operations 
(obj iV opj.j) and (obj i+I , op i+1 ), the worse is omitting such an operation. Therefore, the 
temporal and the spatial distances are squared in the following definition of the 
average omitting degree (omitDeg), which is the sum of a time component (timeOD) 
and a distance component (distOD). 



(V) 



" ( time f time i t ) 2 + ( time i time i+1 ) 2 

timeOD := 

k= 0 {obj,.op) res k with(obj, ,op) RES" 4 timeSum 



n 

distOD := 

k=0 



distiloc^loCj j) 2 + dist(loc i ,loc j+1 ) 2 



(objj,op) res k with(objj ,op) RES" 



4 distSum 



omitDeg := timeOD + distOD 



with 



n 

timeSum := (lime, time i+1 y 

k = 0 ( objj,op ) res k with(objj ,op) RES 



distSum := (dist(loc i , loc i+1 )) 2 

k = 0 ( objj,op ) res k with ( obj t pp ) RES 
n n' 

RES := [J res k and RES":= [J res" k 

k=0 k = 0 

dist denotes the Euclidean distance between two locations. The divisors timeSum and 
distSum are used for normalizing the distances. If no neighbored operation exists, the dis- 
tance to it will be defined as 0. If an object is completely missing in RES' ’, all corre- 
sponding operations are aggregated in omitDeg. The complete result of a continuous 
query has the omitting degree 0. An empty result set would have an omitting degree of 1 . 



6 Related Work 

Terry et al. [28] presented the notion of “continuous queries” in the field of database 
systems. They proposed an incremental evaluation approach in order to avoid repeti- 
tive computations. In the NiagaraCQ [4], also performance issues of the database 
system have been the target. In order to support millions of queries, similar query 
conditions are grouped, which allows sharing common computations and reducing the 
I/O cost. 
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The presented work is also related to the field of spatiotemporal database systems. 
In general, they have been discussed already in Section 2.4. This section refers to 
more specific papers with respect to continuous queries. 

Wolfson et al. [33] address in their paper the problem to determine when the 
location of a moving object in database should be updated: a dead-reckoning update 
policy dictates that there is a deviation for which an object (e.g. a car with a GPS re- 
ceiver) should send a location or speed update to the database. In some sense, this 
problem is complementary to continuous queries. In [27], three types of queries (in- 
stantaneous queries, continuous queries and persistent queries) are distinguished for 
the case of having dynamic spatial attributes, i.e. the attribute changes continuously as 
a function of time, without being explicitly updated. 

Because the transmission of updates is triggered by the database system, the area of 
active database systems must be considered. Paton and Diaz give a good overview of 
this topic in [16]; they close their article with the remark that further research is 
required (among others) on the integration of active behavior with temporal facilities. 
With respect to this topic, the paper of Sistla and Wolfson [26] presents two lan- 
guages for specifying temporal triggers; one of them handles temporal logic of future 
events. However, special mechanisms for filtering events are not discussed in this 
publication. The focus of Ramamritham et al. in their paper on integrating temporal, 
real-time, and active databases [18] is on recovery and transaction issues. 

Besides, the work of Rosenblum and Wolf [20] should be mentioned. In their 
framework for Internet-scale event observation and notification, they demand a filter 
policy within their observation model, however, without giving any details. 

For designing and implementing algorithms used by the filtering component, 
attention should be given to the work on incremental maintenance of materialized 
views where similar problems occur, see e.g. [19] and [22], 



7 Conclusions 

In this paper, an important query in the field of spatiotemporal database systems - the 
continuous query - has been investigated. The work was motivated by applications 
tracking and presenting mobile objects in an Internet environment where different 
types of clients including mobile devices are used. For such applications querying 
moving spatial objects, an architecture has been proposed, which supports an XML- 
based data representation. This architecture has been the base for discussing con- 
tinuous queries. 

After a classification of the update operations in a spatiotemporal database system, 
a formal definition of the result set of continuous queries has been presented. In order 
to reduce the volume and frequency of transmissions to the clients, parameters have 
been defined in order to model technical restrictions as well as the interest of a client 
in a distinct update operation. A component for performing such a filtering has been 
integrated into our architecture. However, delaying and not transmitting update 
operations to a client mean to decrease the quality of the query result. Therefore, two 
quality measures have been presented. 

The next step will be the development on efficient algorithms for performing the 
filtering. A special attention will be given to maintain the quality of the query results. 
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A further requirement to such algorithms concerns their scalability for supporting 
large number of clients. 

The definition of continuous queries and the XML-based representation of 
spatiotemporal objects were based on a quite simple model of spatiotemporal objects. 
Therefore, future work should cover a definition using a more expressive data model. 
Especially, the support of motion described by a motion vector instead of a constant 
object position must be investigated. 

Another aspect is the behavior of the restricting parameters. The resolution of a 
client may be changed by performing a zoom operation and the parameters minOps, 
maxOps and minPeriod may be affected by the traffic of other users of the network 
connection or by a changed capacity of the connection. For example, using the new 
mobile telephone standard UMTS, the maximum speed of a connection will depend 
on the distance of the mobile telephone to the next base station. Therefore, filter 
algorithms are required, which observe varying parameters. 
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Abstract. Several studies have focused on the efficient processing of simple 
spatial query types such as selections and spatial joins. Little work, however, 
has been done towards the optimization of queries that process several spatial 
inputs and combine them through join and selection conditions. This paper 
identifies the dependencies between spatial operators and illustrates how they 
can affect the outcome of complex queries. A thorough analysis yields selectiv- 
ity estimations that can be used to optimize any combination of spatial and non- 
spatial selection and join operators. The accuracy of the formulae is evaluated 
through experimentation with various queries. In addition to their importance 
for spatial databases, the presented results can be applied in several other do- 
mains, where dependencies exist between operators. 



1 Introduction 

Relational queries are processed by combining operators (e.g., sort-merge, hash-join, 
index-based search) in an optimal way [10]. Optimization algorithms (e.g., dynamic 
programming) search through the space of valid plans in order to identify one with 
low cost. Most query optimizers estimate the output sizes using catalog information 
(e.g., size and distribution) about the relations involved in the queries. The core as- 
sumption is that there are no dependencies between different attributes. Therefore, the 
results of any complex query are assumed to depend solely on the distribution of the 
data from the base tables. In many cases the independence assumption holds and the 
optimizer returns accurate results. For instance, assume that two tables Employee and 
Department are joined through the common attribute Deptld, and the following selec- 
tions apply: Employee.Age>35 and Department. Sales< 10000. The Deptld values 
after the application of selections are expected to maintain their initial distribution, 
since there is no explicit dependency between the query attributes. Thus the cost of 
the join can be estimated using the selectivity of the selections and statistical informa- 
tion about the base tables. 

In spatial database systems [9], a complex query may contain several spatial and 
non-spatial components. For instance, the query “find all cities within 400km of Mu- 
nich, which are less than 10km away from a forest and are crossed by a river wider 
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than 20m” includes two spatial joins (City M Forest, City M River), a non-spatial 
selection (River.Width > 20m) and a spatial selection (City.CRegion within 400km of 
Munich). Although there may be no dependency between the non-spatial attribute 
River.Width with the spatial (i.e., location/extent), notice that there is always a de- 
pendency between the spatial operators involved in a query. We do not expect that 
cities located in America or Asia will be part of the result. Therefore the spatial selec- 
tions affect not only the number of objects that will participate in a succeeding join, 
but also their spatial distribution, which determines the query result. 

The next example strengthens our point. Consider the plan of Figure la, where If, 
R, are spatial relations, wI , w2 spatial selections (e.g., window queries) and If a third, 
not necessarily spatial, relation. Figures lb and lc illustrate example results of the 
selections applied to the corresponding dataset. Clearly the cost of the last join de- 
pends on the selectivity of the first (spatial) one. The output of the spatial join, how- 
ever, does not depend solely on the results of the window queries, but also on their 
relative position. As Figure Id shows, the output size of the join increases with the 
intersection area of the windows since only objects inside or near the intersection may 
participate in the result. If the windows are far apart, the result of the spatial join is 
expected to be empty. 




Fig. 1 . Example of spatial operator dependency 



Although interdependencies between selection and join attributes are not common in 
non-spatial queries (i.e., relations are typically joined on a key attribute, whereas 
selections apply on non-keys), they always exist in spatial queries because there is 
typically only one spatial attribute per relation. This paper studies selectivity estima- 
tion of complex spatial queries that involve spatial selections and joins. More specifi- 
cally, given a query consisting of n relations joined on their spatial attribute and po- 
tentially restricted by selection windows, we provide accurate formulae that estimate 
its output size. Following the common conventions in spatial query optimization, we 
assume that the input data are uniformly distributed and that the predicate is intersect 
(overlap). The proposed formulae use catalog information about the mean sizes of the 
objects in spatial relations to estimate the query result. 

Our techniques can be applied for arbitrarily distributed datasets using local statis- 
tics like 2D-histograms. They are also appropriate for spatial queries with predicates 
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other than intersect (e.g. meets, covers ), since such queries can be transformed to 
intersection queries (see [20] for a case study). Even more importantly, the methods 
are not strictly limited to the spatial domain, because dependencies between operators 
can also be found in other database applications. The rest of the paper is organized as 
follows. Section 2 provides background on processing and optimization of simple 
spatial query types and multiway spatial joins. Section 3 presents formulae that esti- 
mate the selectivity of complex spatial queries. In Section 4 we evaluate the accuracy 
of the proposed models for complex spatial queries on uniform data. Section 5 dis- 
cusses extensions to real data and Section 6 concludes the paper. 



2 Background 

Spatial database systems [9] organize and manage large collections of multidimen- 
sional data. Spatial relations, apart from conventional attributes, contain one attribute 
that captures the geometric features of the stored objects. For example, the last attrib- 
ute in relation City (CName, PostcilCode, Population, CRegion) is of spatial type poly- 
gon. In addition to traditional data structures (e.g., B-trees) for alphanumeric 
attributes, spatial relations are indexed by multidimensional access methods [7], usu- 
ally R-trees [8], for the efficient processing of queries such as spatial selections (or 
window queries) and spatial joins. Selections (e.g., “cities in Germany”) apply on a 
single relation, while spatial joins (e.g. “cities crossed by a river”) combine two rela- 
tions with respect to a spatial predicate (typically intersect, which is the counterpart of 
the relational equi-join). 

Complex spatial queries include spatial and non-spatial selections and joins. They 
can be processed by combining simple operators in a processing tree (plan) like the 
one illustrated in Figure la. The efficiency of an operator depends on whether its 
input (inputs) is (are) indexed. For instance, the cost of a window query applied on an 
R-tree is typically linearly related to the size of the window. On the other hand, if the 
selection applies on intermediate results, the whole input needs to be scanned inde- 
pendently of the selectivity of the operator. The same applies for join operators. The 
most efficient spatial join method is the R-tree join (RJ) [5], which matches two R- 
trees. Some methods [15, 18] join an R-tree with some non-indexed dataset (e.g., an 
intermediate result of another operator). Others [16, 19, 14, 2] organize two non- 
indexed inputs in intermediate file structures (e.g., hash buckets) in order to join them 
efficiently in memory. 

Typically, a complex query has a large number of potential execution plans with 
significant cost differences. For instance, an alternative plan to that of Figure la 
would be to first join R 2 with R„ then apply the selection on R, and finally join the 
intermediate result with R r after it has been restricted by w r The number of plans 
increases exponentially with the number of involved relations (see [18, 25] for an 
analysis on spatial and non-spatial domains). Optimization algorithms search either in 
a deterministic (e.g., dynamic programming) or a randomized way (e.g., hill climbing 
[12]) to find a cheap plan. The cost of a specific plan is computed using formulae for 
(i) the operators involved in the plan, and (ii) the output size of each sub-query of the 
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plan. The first provide an estimate for the cost of each node in the plan, while the 
second determine the cost of succeeding operators. In the query of Figure la, for 
instance, the selectivity of the spatial join R-f<\R 2 affects the cost of the final operator. 

There has been extensive research on the accurate estimation of the selectivity and 
cost of spatial operators. The selectivity of spatial selections has been studied as a 
prerequisite for the I/O cost estimation of R-tree window queries [13, 24, 23, 27], 
Given a spatial dataset R of N ^-dimensional uniformly distributed rectangles in a 
rectangular area r (workspace), the number of rectangles that intersect a window 
query w ( output cardinality - OC) is estimated by the following formula: 

k 

OC(R,w) = N min 1 , Sd - Wd (1) 

<*=i r a 

where s d is the average length of the projection of a rectangle s R at dimension d. 

w d and r d are the corresponding projections of w, r respectively. The last factor 
(product) of Equation 1, called Minkowski sum, is the selectivity of the window query 
(i.e., the probability that a random rectangle from R intersects w). This probability at 

some dimension d equals the sum of projections s d and w d on that dimension nor- 
malized to the workspace. Equation 1 can be extended for the output cardinality of an 
(intersection) spatial join between two relations R 2 and R 2 as follows [28]: 

OC(R l ,R 2 ) = N l N 2 k min 1, ^ (2) 

i = 1 r d 

N r N 2 denote the cardinalities of the datasets, and s Y d , s 2 d correspond to the average 

length of the projection of rectangles .y ( R 2 and s 2 R 2 on dimension d. In other 
words, the expected number of rectangle pairs that intersect is equal to the number of 
results after applying N 2 window queries of area s 2 on R r 

The selectivity of multiway spatial joins can be accurately estimated only for acy- 
clic and clique (i.e., complete) query graphs. Let R p ..., R n be n spatial datasets joined 
through a query graph Q, where Q. = True, iff rectangle r It should intersect rec- 
tangle r. R . When Q is acyclic, the number of qualifying object combinations can 
be estimated by: 

OC{R l ,...,R n ,Q) = " N , * min l / u (3) 

i = 1 i,j:Qij=TRUE d= 1 r d 

The above formula actually restricts the Cartesian product of the datasets using the 
selectivities of the query edges, which are independent. When the query graph con- 
tains cycles, the pairwise selectivities are no longer independent. For instance, if a 
intersects b and b intersects c, the probability that a intersects c is relatively high 
because the rectangles are expected to be close to each other. Thus Equation 3 is not 
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appropriate for such cases. For the special case where Q is complete (clique) a closed 
formula that estimates the output of the multiway join is proposed in [21]: 

n k 1 n n 

OC(R l ,...,R ll ,Q) = N, = s Ud (4) 

;=i d = i ( n 1) r d i=i j=i,j i 

This formula can be derived by the observation that if [,v,, s 2 , . .., s nl } is a clique then 
{.v r .y 2 , s nI , s n } is also a clique iff s K intersects the common area of all rectangles in 
{s p s 2 , . .., s nl }. Equations 1 through 4 are accurate for uniform datasets covering the 
same workspace r. In real life applications these assumptions may not hold, therefore 
researchers have extended some of them for skewed datasets. A histogram-based 
method that estimates the selectivity of window queries is presented in [3]. This 
method decomposes the space irregularly using buckets that cover objects of similar 
density and keeps statistical information for each bucket, considering that its contents 
are uniform. A similar technique that divides the objects using a quad-tree like parti- 
tioning is presented in [1]. Another method that applies only on point datasets and 
uses theoretical laws is proposed in [4] . Regarding the selectivity of spatial joins, 
relatively little work has been done. In [28, 18] the space is decomposed using a regu- 
lar grid and uniformity is assumed for each cell. The output of the join is then esti- 
mated by summing up the estimations from each cell. In [6] an interesting law that 
governs the selectivity of distance spatial joins (i.e., is joins that return point pairs 
within a distance parameter) is presented. 

As discussed in the introduction, the relative positions of selection windows de- 
termine the skew of the joined rectangles from each dataset. Thus existing formula 
that focus exclusively on spatial selections or joins are not applicable. In the next 
section we study the selectivity of complex spatial queries, where the dependency of 
operators affects the query result. 



3 Selectivity of Complex Spatial Queries 

When only one selection window w i that applies on a single dataset R. exists, selectiv- 
ity estimation is rather simple. Since the result is the same independently of the order 
according to which operators are applied, we can assume that the selection follows 
the join. The output cardinality can then be estimated by multiplying the correspond- 
ing join formulae with the selectivity of the window query: 

* \ -L- it; 

0C(/om, vv ; ) = OC(Join) min 1, d — d (5) 

r d 



where OC(Join) can be any of Equations 2, 3 or 4. 

The problem is more complicated when two or more spatial selections exist on the 
joined datasets. A simple approach is to assume that the join workspace is the com- 
mon area of all selections, and apply a single selection on the multiway join result 
using this area. This, however, would be inaccurate since the common area of selec- 
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tion windows may be empty, while there may exist objects that qualify the query, 
especially if the non-intersecting windows are close to each other and the data rectan- 
gles are large. 



3.1 Selectivity of a Pairwise Join Restricted by Two Selections 

Let us first confine our study to the special case where a pair of joined datasets R t 
and R 2 are restricted by two query windows w I and w 2 , respectively. Based on the 
extents of w 7 and w 2 we will try to identify the area that should be intersected by rec- 
tangles from each dataset in order to participate in the join. Thus, we will compute the 
query selectivity in three steps: (i) determine a number of candidate join objects from 
each dataset using w r w 2 , (ii) estimate the workspace area of the join and (iii) com- 
pute the join selectivity using the number of candidates, the estimated workspace area 
and the average rectangle extents according to Equation 2. 

Let w id be the projection of w. on dimension d, w ids and w lde be the starting and the 

ending point of w id , respectively, and w id be the length of w id . Consider also similar 
notations for the corresponding properties of the average extents of a rectangle y in 
dataset R. (e.g., y d is the average projection of s. on dimension d). We define two 



updated windows vv, , w 2 , as follows: 




= maxfvv, 


is’ W 2.d.f ^2 ,d } 


(6a) 


= minfw, 


d,e’ W 2,d,e+ S 2,d ! 


(6b) 


w 2As =ma \{w 2 


is’ W l.d.f ^1 ,d } 


(6c) 


w 2 .te = min{w 2i 


d,e’ W l.d.e+ S l,d } 


(6d) 



In order for a rectangle from R. to be candidate for the join and intersect w r it should 
intersect w. . For instance, consider the selection windows w,, w 2 and the updated ones 
w, , w 2 in Figure 2a. Object a, belonging to R, and intersecting w,, cannot participate 
in the join because it may not overlap some object from R , that intersects w 2 . On the 
other hand, object b that intersects vv ( is a potential query result. 

Intuitively, w I and w 2 define the area where the spatial join is restricted. The num- 
ber of rectangles that participate in the join from dataset R. is determined by the selec- 
tivity of the corresponding w i . Notice that the end points set by Equations 6a-d do not 
always define valid intervals, since the lower end point may be greater than the upper 
end point. This situation may arise when the actual windows do not intersect and 
there is a large distance between them. In this case (if w lds > w lde for some i, d), the 
length of the corresponding projection is negative, and the selectivity of w id is posi- 
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tive only when s id > w lds - w lde ; otherwise, the selectivity of the complex query is 
zero. 

So far, we have calculated the windows w, and w 2 that should be intersected by 
each rectangle from R I and R 2 , respectively, in the result. These windows can be used 
in combination with Equation 1 to determine the number of join candidates from each 
dataset. The next step is to estimate the workspace of the join c , i.e., the area where 
the query results lie. The rectangles from R. that may participate in the join are inside 

a window c ( , which is generated by extending w td with s id at both sides on each 

dimension. For instance, in Figure 2b we know that join candidates from R I are in- 
cluded in c r Since c, and c 2 do not cover the same area, we need to average them in 
order to acquire a unique rectangle c that reflects best the area where the spatial join 
is restricted. Figure 2c provides an example of this normalization. Formally: 



c d . s = max { w I ds - s l d , w 2 d 


, - S 2,d } - K M 


■ S l,d “ W 2,d,s + S 2,d 


(7a) 


c de = min{w Ide + s ld ,w 2d 


, r +S 2.d 1 + \ W l,c 


U + S l,d “ W 2.d,e " S 2,d 1^ 


(7b) 



Figure 3 shows some more one-dimensional examples with four representative win- 
dow configurations over one pair of datasets. In case 3, the workspace is non-empty, 
although the original windows do not intersect. In case 4, the updated windows w, , 
w 2 are invalid; a rectangle from R. should intersect both endpoints of the invalid win- 
dow w, in order to be a candidate join object. Flowever, the average length of a rec- 
tangle Sj R, is smaller than the distance of the endpoints of w, , thus the join is con- 
sidered to have zero selectivity. 

Given the join workspace c, selectivity can be computed using the existing formu- 
lae for spatial selections and joins. Summarizing, the output cardinality of a query 
that includes the spatial join of two datasets R r R 2 restricted by window queries w p 
w 2 , respectively, is estimated by the following formula: 

OC(R l ,R 2 ,w 1 ,w 2 ) = OC(R l ,w l ) OC(R 2 ,w 2 ) min 1, (8) 

d = 1 C d 





(c) join workspace c 



Fig. 2. Generation of join workspace 
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Fig. 3. Windows that define the selectivity of a complex pairwise join 



3.2 Selectivity of Multiway Joins with Selections 

Equation 8 can be extended for complex spatial queries that join n datasets, poten- 
tially restricted by spatial selections. Selectivity is again computed in three steps. At 
the first step the updated window w. is calculated for each dataset R r using the win- 
dows of the neighbors in the join graph Q. At the second step, depending on Q, the 
workspace area of each join edge or of the whole graph is computed. Finally, the 
multiway join selectivity is estimated using (i) the selectivity of w l for each R., (ii) the 
workspace area(s) and (iii) Equations 3, 4. 

The updated selection window vv, for each R. is estimated using the initial windows 
and the query graph. It turns out that the calculation process is not simple, since the 
update of a window vv, to vv, may restrict the already updated window vv of a neigh- 
bor R . This process is demonstrated in the example of Figure 4, where three datasets 
are joined by a chain query and three windows restrict the joined rectangles. Assume 
that we attempt to calculate the updated window vv, for each vv, using the following 
formulae: 



= max { w Us , max { w jds - S j d , Q.. = True } } (9a) 

w Ue = min{w We , min{w Jd + s Jd , Q v = True} } (9b) 

First vv, is restricted to vv, using w 2 and s, (Figure 4c). Then w 2 is restricted to w 2 
using vv,, Sj, w 3 and s 3 (Figure 4d). Observe that the left point end of w 2 has changed, 
and this change should be propagated to vv 7 (Figure 4e), which is shortened on the 
left side. In general, each change at a window should be propagated to all neighbor 
windows in the query graph. 

?CI | 1 v„'ft 1 MJ '^r _ ">& 1 

So ft i 1 i 1 i— i ^ i — i i 

So (ft I 1 I 1 I 1 ^ I 1 

(a) (b) three windows (c) update of to w, (d) update of w 2 to w 2 (e) second update of 

query 



Fig. 4. Selection window update propagation in a network of joined datasets 
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The problem of window updates is similar to achieving local consistency in constraint 
satisfaction problems [29]. Therefore, we use a variation of an arc consistency algo- 
rithm [17] to estimate the final vv for each R.. The algorithm first places all selection 
windows in a queue. If a dataset R. does not have a selection window we set vv. = r, 
i.e., the workspace of the datasets. Then the first window vv. from the queue is picked 
and updated according to the current windows of the neighbors. If there is a change in 
w f the windows of all neighbors not currently in the queue are inserted in it because 
the changes need to be propagated to them. The process continues until the queue is 
empty. The pseudocode of the algorithm is given in Figure 5. 

window_propagation (window w[]. Query Q[] [] ) 
initialize queue; 
for each Rf do 

if Wi does not exist then Wf = r; 
queue . insert (wt) ; 
while not queue . empty ( ) { 

w f = queue . getfirst () ; 
for each dimension d do 

w f ,d,s = max{w f , d , s , max{w jidjS - s jd , Q fj = True}}; 

w f , d ,e = min{w fidi e, min{w jidie + s jd , Q fj = True}}; 

if w f has changed 
for each j , Qfj = True do 
if Wj not in queue then queue . insert (Wj ) ; 

} 

Fig. 5. The window_propagation algorithm 

After the execution of the window propagation algorithm, each window vv, will be 
transformed to the minimum intersection window vv ( . Window vv. is determined by 
the end points of the most restricted window vv. on each dimension. The path connect- 
ing R. with R contains at most n-2 graph nodes (where n is the number of datasets 
involved in the query), meaning that the end points of the window w t can be adjusted 
at most n - 1 times per dimension. Thus, the worst case complexity of the algorithm is 
O (d n), and its computational overhead in the optimization process is trivial. 

The next step of the estimation process is to determine the workspace area c of the 
multiway join. This process is performed in a similar way as described in the previous 
paragraph. First the coverage area c t of each window query w\ is estimated by extend- 
ing vvj with ,sj d at both sides on each dimension. If the query is acyclic the selectivity 

of each query edge Q is normalized with respect to the corresponding pairwise join 
workspace. Therefore Equation 3 is modified as follows: 




iJ'-Qij =TRUE 1 




